Back to home page

OSCL-LXR

 
 

    


0001 .. SPDX-License-Identifier: GPL-2.0
0002 
0003 =====
0004 Tmpfs
0005 =====
0006 
0007 Tmpfs is a file system which keeps all of its files in virtual memory.
0008 
0009 
0010 Everything in tmpfs is temporary in the sense that no files will be
0011 created on your hard drive. If you unmount a tmpfs instance,
0012 everything stored therein is lost.
0013 
0014 tmpfs puts everything into the kernel internal caches and grows and
0015 shrinks to accommodate the files it contains and is able to swap
0016 unneeded pages out to swap space. It has maximum size limits which can
0017 be adjusted on the fly via 'mount -o remount ...'
0018 
0019 If you compare it to ramfs (which was the template to create tmpfs)
0020 you gain swapping and limit checking. Another similar thing is the RAM
0021 disk (/dev/ram*), which simulates a fixed size hard disk in physical
0022 RAM, where you have to create an ordinary filesystem on top. Ramdisks
0023 cannot swap and you do not have the possibility to resize them.
0024 
0025 Since tmpfs lives completely in the page cache and on swap, all tmpfs
0026 pages will be shown as "Shmem" in /proc/meminfo and "Shared" in
0027 free(1). Notice that these counters also include shared memory
0028 (shmem, see ipcs(1)). The most reliable way to get the count is
0029 using df(1) and du(1).
0030 
0031 tmpfs has the following uses:
0032 
0033 1) There is always a kernel internal mount which you will not see at
0034    all. This is used for shared anonymous mappings and SYSV shared
0035    memory.
0036 
0037    This mount does not depend on CONFIG_TMPFS. If CONFIG_TMPFS is not
0038    set, the user visible part of tmpfs is not built. But the internal
0039    mechanisms are always present.
0040 
0041 2) glibc 2.2 and above expects tmpfs to be mounted at /dev/shm for
0042    POSIX shared memory (shm_open, shm_unlink). Adding the following
0043    line to /etc/fstab should take care of this::
0044 
0045         tmpfs   /dev/shm        tmpfs   defaults        0 0
0046 
0047    Remember to create the directory that you intend to mount tmpfs on
0048    if necessary.
0049 
0050    This mount is _not_ needed for SYSV shared memory. The internal
0051    mount is used for that. (In the 2.3 kernel versions it was
0052    necessary to mount the predecessor of tmpfs (shm fs) to use SYSV
0053    shared memory.)
0054 
0055 3) Some people (including me) find it very convenient to mount it
0056    e.g. on /tmp and /var/tmp and have a big swap partition. And now
0057    loop mounts of tmpfs files do work, so mkinitrd shipped by most
0058    distributions should succeed with a tmpfs /tmp.
0059 
0060 4) And probably a lot more I do not know about :-)
0061 
0062 
0063 tmpfs has three mount options for sizing:
0064 
0065 =========  ============================================================
0066 size       The limit of allocated bytes for this tmpfs instance. The
0067            default is half of your physical RAM without swap. If you
0068            oversize your tmpfs instances the machine will deadlock
0069            since the OOM handler will not be able to free that memory.
0070 nr_blocks  The same as size, but in blocks of PAGE_SIZE.
0071 nr_inodes  The maximum number of inodes for this instance. The default
0072            is half of the number of your physical RAM pages, or (on a
0073            machine with highmem) the number of lowmem RAM pages,
0074            whichever is the lower.
0075 =========  ============================================================
0076 
0077 These parameters accept a suffix k, m or g for kilo, mega and giga and
0078 can be changed on remount.  The size parameter also accepts a suffix %
0079 to limit this tmpfs instance to that percentage of your physical RAM:
0080 the default, when neither size nor nr_blocks is specified, is size=50%
0081 
0082 If nr_blocks=0 (or size=0), blocks will not be limited in that instance;
0083 if nr_inodes=0, inodes will not be limited.  It is generally unwise to
0084 mount with such options, since it allows any user with write access to
0085 use up all the memory on the machine; but enhances the scalability of
0086 that instance in a system with many CPUs making intensive use of it.
0087 
0088 
0089 tmpfs has a mount option to set the NUMA memory allocation policy for
0090 all files in that instance (if CONFIG_NUMA is enabled) - which can be
0091 adjusted on the fly via 'mount -o remount ...'
0092 
0093 ======================== ==============================================
0094 mpol=default             use the process allocation policy
0095                          (see set_mempolicy(2))
0096 mpol=prefer:Node         prefers to allocate memory from the given Node
0097 mpol=bind:NodeList       allocates memory only from nodes in NodeList
0098 mpol=interleave          prefers to allocate from each node in turn
0099 mpol=interleave:NodeList allocates from each node of NodeList in turn
0100 mpol=local               prefers to allocate memory from the local node
0101 ======================== ==============================================
0102 
0103 NodeList format is a comma-separated list of decimal numbers and ranges,
0104 a range being two hyphen-separated decimal numbers, the smallest and
0105 largest node numbers in the range.  For example, mpol=bind:0-3,5,7,9-15
0106 
0107 A memory policy with a valid NodeList will be saved, as specified, for
0108 use at file creation time.  When a task allocates a file in the file
0109 system, the mount option memory policy will be applied with a NodeList,
0110 if any, modified by the calling task's cpuset constraints
0111 [See Documentation/admin-guide/cgroup-v1/cpusets.rst] and any optional flags,
0112 listed below.  If the resulting NodeLists is the empty set, the effective
0113 memory policy for the file will revert to "default" policy.
0114 
0115 NUMA memory allocation policies have optional flags that can be used in
0116 conjunction with their modes.  These optional flags can be specified
0117 when tmpfs is mounted by appending them to the mode before the NodeList.
0118 See Documentation/admin-guide/mm/numa_memory_policy.rst for a list of
0119 all available memory allocation policy mode flags and their effect on
0120 memory policy.
0121 
0122 ::
0123 
0124         =static         is equivalent to        MPOL_F_STATIC_NODES
0125         =relative       is equivalent to        MPOL_F_RELATIVE_NODES
0126 
0127 For example, mpol=bind=static:NodeList, is the equivalent of an
0128 allocation policy of MPOL_BIND | MPOL_F_STATIC_NODES.
0129 
0130 Note that trying to mount a tmpfs with an mpol option will fail if the
0131 running kernel does not support NUMA; and will fail if its nodelist
0132 specifies a node which is not online.  If your system relies on that
0133 tmpfs being mounted, but from time to time runs a kernel built without
0134 NUMA capability (perhaps a safe recovery kernel), or with fewer nodes
0135 online, then it is advisable to omit the mpol option from automatic
0136 mount options.  It can be added later, when the tmpfs is already mounted
0137 on MountPoint, by 'mount -o remount,mpol=Policy:NodeList MountPoint'.
0138 
0139 
0140 To specify the initial root directory you can use the following mount
0141 options:
0142 
0143 ====    ==================================
0144 mode    The permissions as an octal number
0145 uid     The user id
0146 gid     The group id
0147 ====    ==================================
0148 
0149 These options do not have any effect on remount. You can change these
0150 parameters with chmod(1), chown(1) and chgrp(1) on a mounted filesystem.
0151 
0152 
0153 tmpfs has a mount option to select whether it will wrap at 32- or 64-bit inode
0154 numbers:
0155 
0156 =======   ========================
0157 inode64   Use 64-bit inode numbers
0158 inode32   Use 32-bit inode numbers
0159 =======   ========================
0160 
0161 On a 32-bit kernel, inode32 is implicit, and inode64 is refused at mount time.
0162 On a 64-bit kernel, CONFIG_TMPFS_INODE64 sets the default.  inode64 avoids the
0163 possibility of multiple files with the same inode number on a single device;
0164 but risks glibc failing with EOVERFLOW once 33-bit inode numbers are reached -
0165 if a long-lived tmpfs is accessed by 32-bit applications so ancient that
0166 opening a file larger than 2GiB fails with EINVAL.
0167 
0168 
0169 So 'mount -t tmpfs -o size=10G,nr_inodes=10k,mode=700 tmpfs /mytmpfs'
0170 will give you tmpfs instance on /mytmpfs which can allocate 10GB
0171 RAM/SWAP in 10240 inodes and it is only accessible by root.
0172 
0173 
0174 :Author:
0175    Christoph Rohland <cr@sap.com>, 1.12.01
0176 :Updated:
0177    Hugh Dickins, 4 June 2007
0178 :Updated:
0179    KOSAKI Motohiro, 16 Mar 2010
0180 :Updated:
0181    Chris Down, 13 July 2020