summaryrefslogtreecommitdiffstats
path: root/doc/cross-compilation.xml
blob: 0b2b85aeeef61118726cf01bdabfd7d805bd33ac (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
<chapter xmlns="http://docbook.org/ns/docbook"
         xmlns:xlink="http://www.w3.org/1999/xlink"
         xml:id="chap-cross">

<title>Cross-compilation</title>

<section xml:id="sec-cross-intro">
  <title>Introduction</title>
  <para>
    "Cross-compilation" means compiling a program on one machine for another type of machine.
    For example, a typical use of cross compilation is to compile programs for embedded devices.
    These devices often don't have the computing power and memory to compile their own programs.
    One might think that cross-compilation is a fairly niche concern, but there are advantages to being rigorous about distinguishing build-time vs run-time environments even when one is developing and deploying on the same machine.
    Nixpkgs is increasingly adopting the opinion that packages should be written with cross-compilation in mind, and nixpkgs should evaluate in a similar way (by minimizing cross-compilation-specific special cases) whether or not one is cross-compiling.
  </para>

  <para>
    This chapter will be organized in three parts.
    First, it will describe the basics of how to package software in a way that supports cross-compilation.
    Second, it will describe how to use Nixpkgs when cross-compiling.
    Third, it will describe the internal infrastructure supporting cross-compilation.
  </para>
</section>

<!--============================================================-->

<section xml:id="sec-cross-packaging">
  <title>Packaging in a cross-friendly manner</title>

  <section>
    <title>Platform parameters</title>
    <para>
      Nixpkgs follows the <link xlink:href="https://gcc.gnu.org/onlinedocs/gccint/Configure-Terms.html">common historical convention of GNU autoconf</link> of distinguishing between 3 types of platform: <wordasword>build</wordasword>, <wordasword>host</wordasword>, and <wordasword>target</wordasword>.

      In summary, <wordasword>build</wordasword> is the platform on which a package is being built, <wordasword>host</wordasword> is the platform on which it is to run. The third attribute, <wordasword>target</wordasword>, is relevant only for certain specific compilers and build tools.
    </para>

    <para>
      In Nixpkgs, these three platforms are defined as attribute sets under the names <literal>buildPlatform</literal>, <literal>hostPlatform</literal>, and <literal>targetPlatform</literal>.
      All three are always defined as attributes in the standard environment, and at the top level. That means one can get at them just like a dependency in a function that is imported with <literal>callPackage</literal>:
      <programlisting>{ stdenv, buildPlatform, hostPlatform, fooDep, barDep, .. }: ...buildPlatform...</programlisting>, or just off <varname>stdenv</varname>:
      <programlisting>{ stdenv, fooDep, barDep, .. }: ...stdenv.buildPlatform...</programlisting>.
    </para>
    <variablelist>
      <varlistentry>
        <term><varname>buildPlatform</varname></term>
        <listitem><para>
          The "build platform" is the platform on which a package is built.
          Once someone has a built package, or pre-built binary package, the build platform should not matter and be safe to ignore.
        </para></listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>hostPlatform</varname></term>
        <listitem><para>
          The "host platform" is the platform on which a package will be run.
          This is the simplest platform to understand, but also the one with the worst name.
        </para></listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>targetPlatform</varname></term>
        <listitem>
          <para>
            The "target platform" attribute is, unlike the other two attributes, not actually fundamental to the process of building software.
            Instead, it is only relevant for compatibility with building certain specific compilers and build tools.
            It can be safely ignored for all other packages.
          </para>
          <para>
            The build process of certain compilers is written in such a way that the compiler resulting from a single build can itself only produce binaries for a single platform.
            The task specifying this single "target platform" is thus pushed to build time of the compiler.
            The root cause of this mistake is often that the compiler (which will be run on the host) and the the standard library/runtime (which will be run on the target) are built by a single build process.
          </para>
          <para>
            There is no fundamental need to think about a single target ahead of time like this.
            If the tool supports modular or pluggable backends, both the need to specify the target at build time and the constraint of having only a single target disappear.
            An example of such a tool is LLVM.
          </para>
          <para>
            Although the existence of a "target platfom" is arguably a historical mistake, it is a common one: examples of tools that suffer from it are GCC, Binutils, GHC and Autoconf.
            Nixpkgs tries to avoid sharing in the mistake where possible.
            Still, because the concept of a target platform is so ingrained, it is best to support it as is.
          </para>
        </listitem>
      </varlistentry>
    </variablelist>
    <para>
      The exact schema these fields follow is a bit ill-defined due to a long and convoluted evolution, but this is slowly being cleaned up.
      You can see examples of ones used in practice in <literal>lib.systems.examples</literal>; note how they are not all very consistent.
      For now, here are few fields can count on them containing:
    </para>
    <variablelist>
      <varlistentry>
        <term><varname>system</varname></term>
        <listitem>
          <para>
            This is a two-component shorthand for the platform.
            Examples of this would be "x86_64-darwin" and "i686-linux"; see <literal>lib.systems.doubles</literal> for more.
            This format isn't very standard, but has built-in support in Nix, such as the <varname>builtins.currentSystem</varname> impure string.
          </para>
        </listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>config</varname></term>
        <listitem>
          <para>
            This is a 3- or 4- component shorthand for the platform.
            Examples of this would be "x86_64-unknown-linux-gnu" and "aarch64-apple-darwin14".
            This is a standard format called the "LLVM target triple", as they are pioneered by LLVM and traditionally just used for the <varname>targetPlatform</varname>.
            This format is strictly more informative than the "Nix host double", as the previous format could analogously be termed.
            This needs a better name than <varname>config</varname>!
          </para>
        </listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>parsed</varname></term>
        <listitem>
          <para>
            This is a nix representation of a parsed LLVM target triple with white-listed components.
            This can be specified directly, or actually parsed from the <varname>config</varname>.
            [Technically, only one need be specified and the others can be inferred, though the precision of inference may not be very good.]
            See <literal>lib.systems.parse</literal> for the exact representation.
          </para>
        </listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>libc</varname></term>
        <listitem>
          <para>
            This is a string identifying the standard C library used.
            Valid identifiers include "glibc" for GNU libc, "libSystem" for Darwin's Libsystem, and "uclibc" for µClibc.
            It should probably be refactored to use the module system, like <varname>parse</varname>.
          </para>
        </listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>is*</varname></term>
        <listitem>
          <para>
            These predicates are defined in <literal>lib.systems.inspect</literal>, and slapped on every platform.
            They are superior to the ones in <varname>stdenv</varname> as they force the user to be explicit about which platform they are inspecting.
            Please use these instead of those.
          </para>
        </listitem>
      </varlistentry>
      <varlistentry>
        <term><varname>platform</varname></term>
        <listitem>
          <para>
            This is, quite frankly, a dumping ground of ad-hoc settings (it's an attribute set).
            See <literal>lib.systems.platforms</literal> for examples—there's hopefully one in there that will work verbatim for each platform that is working.
            Please help us triage these flags and give them better homes!
          </para>
        </listitem>
      </varlistentry>
    </variablelist>
  </section>

  <section>
    <title>Specifying Dependencies</title>
    <para>
      In this section we explore the relationship between both runtime and buildtime dependencies and the 3 Autoconf platforms.
    </para>
    <para>
      A runtime dependency between 2 packages implies that between them both the host and target platforms match.
      This is directly implied by the meaning of "host platform" and "runtime dependency":
      The package dependency exists while both packages are running on a single host platform.
    </para>
    <para>
      A build time dependency, however, implies a shift in platforms between the depending package and the depended-on package.
      The meaning of a build time dependency is that to build the depending package we need to be able to run the depended-on's package.
      The depending package's build platform is therefore equal to the depended-on package's host platform.
      Analogously, the depending package's host platform is equal to the depended-on package's target platform.
    </para>
    <para>
      In this manner, given the 3 platforms for one package, we can determine the three platforms for all its transitive dependencies.
      This is the most important guiding principle behind cross-compilation with Nixpkgs, and will be called the <wordasword>sliding window principle</wordasword>.
    </para>
    <para>
      Some examples will probably make this clearer.
      If a package is being built with a