I learned about /proc/sys/fs/file-nr and /proc/sys/fs/file-max
earlier tonight.
I was sending some mail, cc'd myself and was running fetchmail to
watch for replies. (Sure enough, the person I wanted to correspond
with was working late too...) I noticed that even though fetchmail
said it was fetching some mail (my cc), it wasn't being delivered via
sendmail (which fetchmail invokes).
Diagnosing this problem simply consisted of reading /var/log/maillog.
(Apparently I had had a number of such incidents throughout the day.)
The mail log told me that the system had run out of open files. The
exact message was:
Too many open files in system
This led me to wonder about how one determines how many open files
you are allowed to have and how you find out how many your system
is using... (remember, that we're *not* talking about per-process
limits here, but rather system limits.)
I chose to use the kernel sources as my documentation. ;-)
In fs/file_table.c, I noticed the following declaration early
in the file:
int max_files = NR_FILE;/* tunable */
NR_FILES is defined to be 4096 in include/linux/fs.h. If you
get close to the limit, a user process won't be allowed to open
a new file if it gets within NR_RESERVED_FILES (10) of the limit.
In addition to declaring/defining `max_files', file_table.c also
declares `nr_files' and `nr_free_files'.
`nr_files' is the total number of file structs that the kernel
has allocated thus far. This value is not allowed to exceed
`max_files'. `nr_free_files' is self descriptive, it simply
tells the kernel how many (of the `nr_files' that are allocated)
are free.
Searching for `max_files' in the kernel sources turned out an
occurence in a data structure in sysctl.c which controls how part of
the structure of the /proc file system gets set up.
It turns out that /proc/sys/fs/file-max contains the value of
`max_files'. You may set this value via, e.g,
echo 5200 >/proc/sys/fs/file-max
The file /proc/sys/fs/file-nr is also of interest. It is read-only,
and contains three values (on one line). These are as follows:
- the total number of file structs allocated by the system,
i.e, the value given by `nr_files' in the kernel sources.
- the number of free file structs, i.e, the value given by
`nr_free_files'.
- the maximum number of file structs which may be allocated
by the system. This will be the value of `max_files' in
the kernel sources and is also available by examining
/proc/sys/fs/file-max.
So, for example, after my fiasco with my mail not getting delivered
(which also entailed me frantically closing down a lot of applications
that I wasn't using), I had the following situation:
ocotillo:linux$ cat /proc/sys/fs/file-nr
4096 1323 4096
ocotillo:linux$ cat /proc/sys/fs/file-max
4096
Note that the first and third number being equal indicates that I
actually hit the limit. (Actually, if these numbers even get close,
you're probably in trouble - remember that non-root processes will
fail when the system gets within 10 files of the maximum.)
Then, as root, I did the following:
[root@ocotillo kev]# echo 4608 >/proc/sys/fs/file-max
Running the same two `cat' commands now yields the following
results:
ocotillo:linux$ cat /proc/sys/fs/file-nr
4096 1323 4608
ocotillo:linux$ cat /proc/sys/fs/file-max
4608
Hopefully, this will give me a bit of a cushion for the days to
come.
I was curious to see how many file structs the kernel was using on
a different (less heavily used) system. Here's what I saw:
saguaro:kev$ cat /proc/sys/fs/file-nr
1969 461 4096
saguaro:kev$ cat /proc/sys/fs/file-max
4096
Comparing the first and third numbers indicates that this other system
had come no where close to hitting the limit of 4096.
I have a hunch that those of you who run big web servers and other
sorts of things which may need lots of open files already knew how to
increase the number of maximum number of open files on your system.
But even if you did, I hope that my description of the methodology
that I used to ferret the information out of the kernel was of some
interest.
Kevin
From Anthony Walsh" <
awalsh@itconnections.com Fri Sep 1 15:23:38 2000
From: Anthony Walsh" <
awalsh@itconnections.com (Anthony Walsh)
Date: Fri, 1 Sep 2000 08:23:38 -0700
Subject: LOOK AT THIS!
Message-ID: <
002201c01428$aeeba4e0$061410ac@itconnections.com>
Hey Everyone,
I just found out this morning that the priority on one of my job
openings just became urgent! I need to find THE BEST person for
Linux/Solaris Administrator. This is a perm job with a company that you
cannot pass up on. If you are interested you need to give me a call by
11:30 today. Please email me a resume at awalsh@itconnections.com in a word
format.
Thanks,
Anthony G. Walsh
Account Manager awalsh@itconnections.com
IT Connections, Inc. 602.240.7711 Ext. 14
Your Source For Efficient, Effective IT Expertise.