Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -92,6 +92,7 @@ Contents
installing-solidity.rst
solidity-by-example.rst
solidity-in-depth.rst
using-the-compiler.rst
style-guide.rst
common-patterns.rst
frequently-asked-questions.rst
34 changes: 0 additions & 34 deletions docs/miscellaneous.rst
Original file line number Diff line number Diff line change
Expand Up @@ -95,40 +95,6 @@ is simplified to code which can also be compiled from

even though the instructions contained a jump in the beginning.

.. index:: ! commandline compiler, compiler;commandline, ! solc, ! linker

.. _commandline-compiler:

******************************
Using the Commandline Compiler
******************************

One of the build targets of the Solidity repository is ``solc``, the solidity commandline compiler.
Using ``solc --help`` provides you with an explanation of all options. The compiler can produce various outputs, ranging from simple binaries and assembly over an abstract syntax tree (parse tree) to estimations of gas usage.
If you only want to compile a single file, you run it as ``solc --bin sourceFile.sol`` and it will print the binary. Before you deploy your contract, activate the optimizer while compiling using ``solc --optimize --bin sourceFile.sol``. If you want to get some of the more advanced output variants of ``solc``, it is probably better to tell it to output everything to separate files using ``solc -o outputDirectory --bin --ast --asm sourceFile.sol``.

The commandline compiler will automatically read imported files from the filesystem, but
it is also possible to provide path redirects using ``prefix=path`` in the following way:

::

solc github.com/ethereum/dapp-bin/=/usr/local/lib/dapp-bin/ =/usr/local/lib/fallback file.sol

This essentially instructs the compiler to search for anything starting with
``github.com/ethereum/dapp-bin/`` under ``/usr/local/lib/dapp-bin`` and if it does not
find the file there, it will look at ``/usr/local/lib/fallback`` (the empty prefix
always matches). ``solc`` will not read files from the filesystem that lie outside of
the remapping targets and outside of the directories where explicitly specified source
files reside, so things like ``import "/etc/passwd";`` only work if you add ``=/`` as a remapping.

If there are multiple matches due to remappings, the one with the longest common prefix is selected.

If your contracts use :ref:`libraries <libraries>`, you will notice that the bytecode contains substrings of the form ``__LibraryName______``. You can use ``solc`` as a linker meaning that it will insert the library addresses for you at those points:

Either add ``--libraries "Math:0x12345678901234567890 Heap:0xabcdef0123456"`` to your command to provide an address for each library or store the string in a file (one library per line) and run ``solc`` using ``--libraries fileName``.

If ``solc`` is called with the option ``--link``, all input files are interpreted to be unlinked binaries (hex-encoded) in the ``__LibraryName____``-format given above and are linked in-place (if the input is read from stdin, it is written to stdout). All options except ``--libraries`` are ignored (including ``-o``) in this case.

***************
Tips and Tricks
***************
Expand Down
144 changes: 144 additions & 0 deletions docs/using-the-compiler.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,144 @@
.. index:: ! commandline compiler, compiler;commandline, ! solc, ! linker

.. _commandline-compiler:

******************************
Using the Commandline Compiler
******************************

One of the build targets of the Solidity repository is ``solc``, the solidity commandline compiler.
Using ``solc --help`` provides you with an explanation of all options. The compiler can produce various outputs, ranging from simple binaries and assembly over an abstract syntax tree (parse tree) to estimations of gas usage.
If you only want to compile a single file, you run it as ``solc --bin sourceFile.sol`` and it will print the binary. Before you deploy your contract, activate the optimizer while compiling using ``solc --optimize --bin sourceFile.sol``. If you want to get some of the more advanced output variants of ``solc``, it is probably better to tell it to output everything to separate files using ``solc -o outputDirectory --bin --ast --asm sourceFile.sol``.

The commandline compiler will automatically read imported files from the filesystem, but
it is also possible to provide path redirects using ``prefix=path`` in the following way:

::

solc github.com/ethereum/dapp-bin/=/usr/local/lib/dapp-bin/ =/usr/local/lib/fallback file.sol

This essentially instructs the compiler to search for anything starting with
``github.com/ethereum/dapp-bin/`` under ``/usr/local/lib/dapp-bin`` and if it does not
find the file there, it will look at ``/usr/local/lib/fallback`` (the empty prefix
always matches). ``solc`` will not read files from the filesystem that lie outside of
the remapping targets and outside of the directories where explicitly specified source
files reside, so things like ``import "/etc/passwd";`` only work if you add ``=/`` as a remapping.

If there are multiple matches due to remappings, the one with the longest common prefix is selected.

If your contracts use :ref:`libraries <libraries>`, you will notice that the bytecode contains substrings of the form ``__LibraryName______``. You can use ``solc`` as a linker meaning that it will insert the library addresses for you at those points:

Either add ``--libraries "Math:0x12345678901234567890 Heap:0xabcdef0123456"`` to your command to provide an address for each library or store the string in a file (one library per line) and run ``solc`` using ``--libraries fileName``.

If ``solc`` is called with the option ``--link``, all input files are interpreted to be unlinked binaries (hex-encoded) in the ``__LibraryName____``-format given above and are linked in-place (if the input is read from stdin, it is written to stdout). All options except ``--libraries`` are ignored (including ``-o``) in this case.


**************************************************
Standardized Input Description and Metadata Output
**************************************************

In order to ease source code verification of complex contracts that are spread across several files,
there is a standardized way for describing the relations between those files.
Furthermore, the compiler can generate a json file while compiling that includes
the source, natspec comments and other metadata whose hash is included in the
actual bytecode. Specifically, the creation data for a contract has to begin with
`push32 <metadata hash> pop`.

There is some overlap between the input description and the metadata output
and due to the fact that some fields are optional, the metadata can be used as
input to the compiler. In order to verify the metadata, you actually take it,
re-run the compiler on the metadata and check that it again produces the same
metadata.

If the compiler is invoked in a different way, not using the input
description (for example by using a file content retrieval callback),
the compiler can still generate the metadata alongside the bytecode of each
contract.

The metadata standard is versioned. Future versions are only required to provide the "version" field,
the "language" field and the two keys inside the "compiler" field.
The field compiler.keccak should be the keccak hash of a binary of the compiler with the given version.

The example below is presented in a human-readable way. Properly formatted metadata
should use quotes correctly, reduce whitespace to a minimum and sort the keys of all objects
to arrive at a unique formatting.

Comments are of course not permitted and used here only for explanatory purposes.

Input Description
-----------------

The input description is language-specific and could change with each compiler version, but it
should be backwards compatible if possible.

{
sources:
{
"abc": "contract b{}",
"def": {keccak: "0x123..."}, // source has to be retrieved by its hash
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

like!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As mentioned in the Gist, I think this should have a (optional) type/kind/network field to signal where to look for it, i.e. "swarm", "ipfs".

"dir/file.sol": "contract a {}"
},
settings:
{
remappings: [":g/dir"],
optimizer: {enabled: true, runs: 500},
compilationTarget: "myFile.sol:MyContract", // Can also be an array
// To be backwards compatible, use the full name of the contract in the output
// only if there are name clashes.
// If the full name was given as "compilationTargets", use the full name.
libraries: {
"def:MyLib": "0x123123..."
},
// The following can be used to restrict the fields the compiler will output.
outputSelection: {
// to be defined
}
}
}

Metadata Output
---------------

Note that the actual bytecode is not part of the metadata because the hash
of the metadata structure will be included in the bytecode itself.

This requires the compiler to be able to compute the hash of its own binary,
which requires it to be statically linked. The hash of the binary is not
too important. It is much more important to have the commit hash because
that can be used to query a location of the binary (and whether the version is
"official") at a registry contract.

{
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dont forget the language. (language version maybe even?)
I know in principle the binary's hash identifies if it is serpent or not, but it should allow other high level languages but we dont need to be very frugal here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I actually did not think of this as a generic thing, but it might very well be :-)
Added language field.

version: "1",
language: "Solidity",
compiler: {
commit: "55db20e32c97098d13230ab7500758e8e3b31d64",
version: "soljson-2313-2016-12-12",
keccak: "0x123..."
},
sources:
{
"abc": "contract b{}",
"def": {keccak: "0x123..."}, // source has to be retrieved by its hash
"dir/file.sol": "contract a {}"
},
settings:
{
remappings: [":g/dir"],
optimizer: {enabled: true, runs: 500},
compilationTarget: "myFile.sol:MyContract",
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

now this should really not be here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do you think so? If there are multiple contracts (or libraries) in the input, we have to designate which contract we are referring to.

// To be backwards compatible, use the full name of the contract in the output
// only if there are name clashes.
// If the full name was given as "compilationTargets", use the full name.
libraries: {
"def:MyLib": "0x123123..."
}
},
output:
{
abi: [ /* abi definition */ ],
userDocumentation: [ /* user documentation comments */ ],
developerDocumentation: [ /* developer documentation comments */ ],
natspec: [ /* natspec comments */ ]
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Proposed names for the rest:

  • abi
  • methodIdentifiers / functionHashes
  • evmAssembly
  • evmBytecode
  • evmRuntimeBytecode
  • evmGasEstimates
  • evmOpcodes
  • interface or solidityInterface
  • developerDocumentation
  • userDocumenation
  • natspec
  • why3

I'm not fully sure about source maps?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ignore the bytecode section as it cannot be included per your comment above.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am actually not sure how much to include here. As the metadata output is mainly used for source verification, you can anyway re-generate everything here by re-running the compiler.

I would only include the essential information here which are needed even in the absence of a compiler like the ones given above.

The general output format of the compiler should be a superset of the metadata output or at least largely overlapping. This general output can contain exactly the fields you mention.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To be honest I mistook this PR for the compiler input/output and only realised after that it is for verification purposes.

Should we open a issue / PR / gist for the compiler input/output?

}