summaryrefslogtreecommitdiffstats
path: root/doc/mbox.man
blob: 48d7110762355b659f31b090d5e4466242a69ba6 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
'\" t
.\" -*-nroff-*-
.\"
.\"
.\"     Copyright (C) 2000 Thomas Roessler <roessler@does-not-exist.org>
.\" 
.\"	This document is in the public domain and may be distributed and
.\"	changed arbitrarily.
.\"
.TH mbox 5 "August 2000" Unix "User Manuals"
.SH NAME
.PP
mbox \- Format for mail message storage.
.SH DESCRIPTION
.PP
This document describes the format traditionally used by Unix hosts
to store mail messages locally.  
.B mbox
files typically reside in the system's mail spool, under various
names in users'
.B Mail
directories, and under the name
.B mbox
in users' home directories.
.PP
An 
.B mbox
is a text file containing an arbitrary number of e-mail messages.
Each message consists of a postmark, followed by an e-mail message
formatted according to RFC 822.  The file format is line-oriented.
Lines are separated by line feed characters (ASCII 10).
.PP
A postmark line consists of the four characters "From", followed by
a space character, followed by the message's envelope sender
address, followed by whitespace, and followed by a time stamp. The
sender address is expected to be an
.B addrspec
as defined in appendix D of RFC 822.
.PP
The date is expected to be formatted according to the following
syntax (represented in the augmented Backus-Naur formalism used by
RFC 822):
.PP
.TS
lll.
mbox-date	 =	weekday month day time [ timezone ] year
weekday	 =	"Mon" / "Tue" / "Wed" / "Thu" / "Fri" 
		/ "Sat" / "Sun"
month	 =	"Jan" / "Feb" / "Mar" / "Apr" / "May"
		/ "Jun" / "Jul" / "Aug" / "Sep" 
		/ "Oct" / "Nov" / "Dec"
day	 =	1*2DIGIT
time	 =	1*2DIGIT ":" 1*2DIGIT [ ":" 1*2DIGIT ]
timezone	 =	( "+" / "-" ) 4DIGIT
year	 =	( 4DIGIT / 2DIGIT )
.TE
.PP
For compatibility reasons with legacy software, two-digit years
greater than or equal to 70 should be interpreted as the years
1970+, while two-digit years less than 70 should be interpreted as
the years 2000-2069.
.PP
Software reading files in this format should also be prepared to
accept non-numeric timezone information such as "CET DST" for
Central European Time, dailight saving time.
.PP
Example:
.IP "" 1
From roessler@does-not-exist.org Fri Jun 23 02:56:55 2000
.PP
In order to avoid mis-interpretation of lines in message bodies
which begin with the four characters "From", followed by a space
character, the character ">" is commonly prepended in front of such
lines.
.SH
LOCKING
.PP
Since 
.B mbox
files are frequently accessed by multiple programs in parallel, 
.B mbox
files should generally not be accessed without locking.
.PP
Three different locking mechanisms (and combinations thereof) are in
general use:
.IP "\(bu"
.BR fcntl (2) 
locking is mostly used on recent, POSIX-compliant systems.  Use of
this locking method is, in particular, advisable if 
.B mbox
files are accessed through the Network File System (NFS), since it
seems the only way to reliably invalidate NFS clients' caches.
.IP "\(bu"
.BR flock (2)
locking is mostly used on BSD-based systems.
.IP "\(bu"
Dotlocking is used on all kinds of systems.  In order to lock an
.B mbox
file named
.IR folder ,
an application first creates a temporary file with a unique
name in the directory in which the 
.I folder
resides.  The application then tries to use the
.BR link (2)
system call to create a hard link named
.IB folder .lock
to the temporary file.  The success of the
.BR link (2)
system call should be additionally verified using
.BR stat (2)
calls.  If the link has succeeded, the mail folder is considered
dotlocked.  The temporary file can then safely be unlinked.
.IP ""
In order to release the lock, an application just unlinks the
.IB folder .lock
file.
.PP
If multiple methods are combined, implementors should make sure to
use the non-blocking variants of the
.BR fcntl (2)
and 
.BR flock (2)
sytem calls in order to avoid deadlocks.
.PP
If multiple methods are combined, an
.B mbox
file must not be considered to have been successfully locked before
all individual locks were obtained.  When one of the individual
locking methods fails, an application should release all locks it
acquired successfully, and restart the entire locking procedure from
the beginning, after a suitable delay.
.PP
The locking mechanism used on a particular system is a matter of
local policy, and should be consistently used by all applications
installed on the system which access 
.B mbox
files. Failure to do so may result in loss of e-mail data, and in
corrupted
.B mbox
files.
.SH
FILES
.IP "/var/spool/mail/\fIuser\fP"
.IR user 's
incoming mail folder.
.IP "~\fIuser\fP/mbox"
.IR user 's
archived mail messages, in his home directory.
.IP "~\fIuser\fP/Mail/"
A directory in
.IR user 's
home directory which is commonly used to hold 
.B mbox
format folders.
.SH
SEE ALSO
.PP
.BR elm (1),
.BR fcntl (2), 
.BR flock (2), 
.BR link (2),
.BR local (8), 
.BR mail (1), 
.BR maildir (5), 
.BR mail.local (8), 
.BR mutt (1),
.BR mutt_dotlock (1), 
.BR pine (1),
.BR procmail (1),
.BR sendmail (8)
.PP
D. Crocker, Standard for the format of ARPA Internet text messages,
RFC 822
.PP
M. R. Horton, UUCP mail interchange format standard, RFC 976
.SH
AUTHOR
.PP
The present document was written by Thomas Roessler
<roessler@does-not-exist.org>. 
.SH
HISTORY
.PP
The
.B mbox
format occured in Version 6 AT&T Unix.
.PP
A variant of this format was documented in RFC 976.