1. fprof(8)
  2. fprof(8)

NAME

fprof - File System Profiler

SYNOPSIS

fprof target_directory

mpirun -np 8 fprof ...

DESCRIPTION

fprof is a lightweight profiler designed to provide a set of useful statistical characterization of the target file system at extreme scale. A sample report on ORNL Atlas file system is as the following:

Fileset histograms:

        <  4.00 KiB       19888046           17.08%    ∎∎∎∎∎∎∎∎
        <  8.00 KiB       5708765             4.90%    ∎∎
        <  16.00 KiB      4376361             3.76%    ∎
        <  32.00 KiB      5366888             4.61%    ∎∎
        <  64.00 KiB      7061464             6.07%    ∎∎∎
        <  256.00 KiB     13388304           11.50%    ∎∎∎∎∎
        <  512.00 KiB     8406809             7.22%    ∎∎∎
        <  1.00 MiB       3652496             3.14%    ∎
        <  4.00 MiB       13817394           11.87%    ∎∎∎∎∎
        <  16.00 MiB      13088330           11.24%    ∎∎∎∎∎
        <  512.00 MiB     18992735           16.32%    ∎∎∎∎∎∎∎∎
        <  1.00 GiB       1918471             1.65%
        <  100.00 GiB     740765              0.64%
        <  256.00 GiB     2779                0.00%
        <  512.00 GiB     630                 0.00%
        <  1.00 TiB       699                 0.00%
        >  1.00 TiB       458                 0.00%

Fprof epilogue:

        Directory count:         35,945,074
        Sym Links count:         1,050,485
        File count:              116,411,394
        Skipped count:           364
        Total file size:         8160.08 TiB
        Avg file size:           73.50 MiB
        Max files within dir:    1,003,319

fprof supports two other options:

An sample output of --gpfs-block-alloc is as the following, against a dataset with mostly small files. The report shows that if GPFS blocksize is 8M, then the 1GB dataset will inflate 10x to 10.39GB, with a poor space efficiency of 10%. However, this trend should not generalize to other file systems. We observe that for file system with many large files, this space inefficiency can be largely neglected.

GPFS Block Alloc Report:

        Subblocks: [162887  90033  64419  45701  42551]

        Blocksize: 256k     Estimated Space: 1.24 GiB               Efficiency:    86%
        Blocksize: 512k     Estimated Space: 1.37 GiB               Efficiency:    78%
        Blocksize: b1m      Estimated Space: 1.97 GiB               Efficiency:    55%
        Blocksize: b4m      Estimated Space: 5.58 GiB               Efficiency:    19%
        Blocksize: b8m      Estimated Space: 10.39 GiB              Efficiency:    10%

AUTHOR

Feiyi Wang (fwang2@ornl.gov)

  1. December 2015
  2. fprof(8)