Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Determining Usage

Each user is assigned a disk space quota, or allotment, when their account is created to insure that there will be sufficient disk space for all our users’ needs. If you need additional space, please submit your request to manager@stat.berkeley.edu.

The commands below can show you how disk space you are consuming. For additional information, consult the respective man pages.

CommandDescription
ls -lLists size of all files in the current directory.
bigfilesRecursively searches a directory for big files.
quotaReports your disk quota and current disk use.
du -h .Recursive summary of disk use for the current directory.
dustSimilar to du, but faster and more visual.
find . -lsRecursively lists all file sizes in the current directory.

Temporary Disk Space

There are several places where users can temporarily store data. These areas are often larger than home directories, but they are not backed up and may be removed without notice.

/tmp

Files put in the /tmp directory are only accessible on the machine on which they were created and are automatically wiped everytime the computer is rebooted. Files may also be deleted with little or no warning if resources become scarce. However, if you need a large amount of disk space for a short amount of time, /tmp provides a solution which does not need any staff intervention. Remember that there is no guarantee that files stored in /tmp are safe. Do not use /tmp for data that that is difficult or expensive to re-create. No special permissions are required to use /tmp. To reference your files using /tmp, use /tmp as the prefix to the name of the file, for example /tmp/myfile.

The limit to the amount of storage a user can take up is the physical limitation of the partition. However, if /tmp is full, editors, compilers, and many other programs will not work or behave erratically. To find out how much space is available in /tmp on your system, type df -k /tmp. Do not use /tmp if less than 30% of the space is available. Remove files when they are no longer needed.

/var/tmp

The /var/tmp directory functions similarly to /tmp, however, files are not automatically removed after the machine is rebooted. This directory does get erased, however, whenever the workstation needs to be reinstalled or reconfigured. Otherwise, the same policies that apply to /tmp apply to /var/tmp.

/data

The /data directory exists on some systems that have secondary disks. This directory functions similarly to /var/tmp, however, it does not get erased when the computer is reinstalled or reconfigured.

/scratch

Directories under /scratch exist on the file server and can be accessed from every machine. Unlike users’ home directories, they are not backed up, but can usually accommodate larger data files. If space becomes limited we may automatically compress files or (if time permitting) ask users to either remove or archive files that are no longer needed in order to make room for other users. Files that are not actively being used should be compressed if possible.

Compression

Infrequently accessed files may be compressed to save disk space.

See the UNIX manual pages for the above programs by using the ‘man’, for example ‘man gzip’.

CommandDescription
gunzip file.gzUncompresses file.gz to file
gzip fileCompresses file to file.gz
tar xzf file.tar.gzExtract the contents of file.tar.gz
tar czf file.tar.gz file1 [file2...]Compresses files into file.tar.gz
unzip file.zipUncompresses file.zip
zip file.zip file1 [file2...]Compresses one or more files to file.zip
zcat x.Z y.Z ...Prints the compressed file(s) to the terminal

Some applications leave behind files that may be removed without adversely affecting the program.

CommandDescription
rm fileRemoves ‘file’ if you have write permission on it.
rm -f fileRemoves ‘file’ if you have write permission in the directory containing it.
rmdir dirRemoves the empty directory ‘dir’.
rm -rf dirRecursively remove dir including every file and subdirectory. Use with caution.
strip a.out     Strips an already existing 'a.out'.
f77 -s ...      Creates a pre-stripped '.out' file when using f77.
cc -s ...       Similarly for cc.

Web browser cache files can be removed. You can clear your browser disk cache, as well as instruct the browser to set aside less disk space to use for its cache, in the program’s Preference window, usually under ‘Advanced’.

PostScript, .aux, and .log files produced by LaTeX can be recreated as necessary from the corresponding .tex file.

Compiled object files, often with the suffix .o, produced by C, C++, or Fortran compilers, can usually be deleted. If need be, they can be recreated from the original programs that produced them.

Compilers tend to create lots of big binary files such as ‘.o’ and ‘.out’ files. ‘.out’ files in particular can be quite large. If you have such files which have been unused for several days and which you don’t intend to use for several more days, they should be removed. (They can easily be recreated if you need them.) The size of ‘.out’ files which you do need can be reduced somewhat by stripping them. Type:

When programs crash, they sometimes report ‘Core dumped’ indicating that a large file called ‘core’ has been created in the program’s current working directory. A user may disable core dumps by adding “ulimit -c 0” to ~/.bashrc.

Best Practices

The Library and Research IT have a Research Data Management (RDM) program that, “helps researchers navigate the complex landscape of managing data before, during, and after their research.”

Deleting Files

To remove files and directories type: