Linux Tricks Find Sparse Files Filesystems-Ext And Xfs
find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}' | xargs ls -lsh+
The command above will quickly catch all sparse files “bigger” J than one Gigabyte, but WILL NOT cross all filesystems ( all network and mounted filesystems will be excluded from the search e.g /oracle /usr/sap etc…) If you want to change the size, please replace the “+1G” with the desired size. For example +10G will return all files bigger than 10 Gigabytes
- A very fast find, using xdev – finds only two files – lastlog and faillog :
rdeluh0001050:/var/tmp # find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}'
/var/log/faillog
/var/log/lastlog
man find -printf
->%p File's name.
->%S File's sparseness. This is calculated as (BLOCKSIZE*st_blocks / st_size). The exact value you will get for an ordinary file of a certain length is system-dependent. However, normally sparse files will have values less than 1.0, and files which use indirect blocks may have a value which is greater than 1.0. The value used for BLOCKSIZE is system-dependent, but is usually 512 bytes. If the file size is zero, the value printed is undefined. On systems which lack support for st_blocks, a file's sparseness is assumed to be 1.0.
- We execute ls –lsh and see the difference (DP is processing the file size in red ):
rdeluh0001050:/var/tmp # find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}' | xargs ls -lsh
36K -rw------- 1 root root 30G Jul 25 18:05 /var/log/faillog
40K -rw-r--r-- 1 root tty 273G Dec 22 17:48 /var/log/lastlog
- I create a 2GB sparse file with actual size 0 bytes under /var/tmp/ , named kalintest.sparse :
rdeluh0001050:/var/tmp # dd if=/dev/zero of=kalintest.sparse bs=1 count=0 seek=2G
0+0 records in
0+0 records out
0 bytes (0 B) copied, 6.385e-06 s, 0.0 kB/s
- Running the find command again finds it, alongside with the other files :
rdeluh0001050:/var/tmp # find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}' | xargs ls -lsh
36K -rw------- 1 root root 30G Jul 25 18:05 /var/log/faillog
40K -rw-r--r-- 1 root tty 273G Dec 22 17:48 /var/log/lastlog
0 -rw-r--r-- 1 root root 2.0G Dec 22 18:29 /var/tmp/kalintest.sparse
- I create another file under /var/tmp the same way, this time named kalnitest.sparse2 :
rdeluh0001050:/var/tmp # dd if=/dev/zero of=kalintest.sparse2 bs=1 count=0 seek=2G
0+0 records in
0+0 records out
0 bytes (0 B) copied, 6.936e-06 s, 0.0 kB/s
- The file is found again, using the same command :
rdeluh0001050:/var/tmp # find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}' | xargs ls -lsh
36K -rw------- 1 root root 30G Jul 25 18:05 /var/log/faillog
40K -rw-r--r-- 1 root tty 273G Dec 22 17:48 /var/log/lastlog
0 -rw-r--r-- 1 root root 2.0G Dec 22 18:29 /var/tmp/kalintest.sparse
0 -rw-r--r-- 1 root root 2.0G Dec 22 18:30 /var/tmp/kalintest.sparse2
- I delete both files, running the find again returns the desired results :
rdeluh0001050:/var/tmp # rm /var/tmp/kalintest.sparse*
rdeluh0001050:/var/tmp # find / -xdev -type f -size +1G -printf "%S\t%p\n" 2>/dev/null | gawk '{if ($1 < 1.0) print $2}' | xargs ls -lsh
36K -rw------- 1 root root 30G Jul 25 18:05 /var/log/faillog
40K -rw-r--r-- 1 root tty 273G Dec 22 17:48 /var/log/lastlog
rdeluh0001050:/var/tmp #