FAQ: Is the data in the cloud cluster backed up? Answer: No. The data in the cluster is not backed up. You should make your own copies of the data in order to ...
FAQ: What is the "distributed cache" feature provided by Hadoop? and How can my application use it? Answer: Some jobs require each Map task to read in one or mor...
FAQ: How do I transfer large files between the Cloud Cluster HDFS and a host on campus that is outside the cluster? Answer: $ Transferring into HDFS: To copy ...
FAQ: How do I access the status page for the jobs? Answer: * First, configure your browser to access the cluster through the proxy serer. See CloudFaqBrowserP...
FAQ: How do I log on to cluster? Answer: Use SSH to initiate a session to the login node for the cloud cluster: ssh shell.disc.pdl.cmu.local From the login node ...
FAQ: How do I check the quota for my allocated storage? Answer: HDFS Storage: To check the quota for your home directory use dfs repquota as follows. Note: Raw ...
FAQ: What happens when I exceed the allocated storage space? Answer: The answer depends on the type of storage being used. Home Directory: For yourhome directory...
FAQ: How do I configure memory parameters for my mapreduce jobs? Answer: Memory Parameters: There are several memory parameters configurable for users: * map...
FAQ: How do I change my password (or login shell)? Answer: Use the form at https://alpha.pdl.cmu.edu/~account/cgi bin/chpw and change the password for your "...
FAQ: How to submit Hadoop jobs to the Cloud Cluster Answer: Run following command under Linux shell of the login node, hadoop jar YourJar.jar YourClass CommandLi...
CMU Cloud Computer Cluster at the Parallel Data Laboratory The OpenCloud cluster was in service from 2009 through 2018. * PDL Cloud cluster overview covers t...