-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Document input description and metadata output. #645
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,144 @@ | ||
| .. index:: ! commandline compiler, compiler;commandline, ! solc, ! linker | ||
|
|
||
| .. _commandline-compiler: | ||
|
|
||
| ****************************** | ||
| Using the Commandline Compiler | ||
| ****************************** | ||
|
|
||
| One of the build targets of the Solidity repository is ``solc``, the solidity commandline compiler. | ||
| Using ``solc --help`` provides you with an explanation of all options. The compiler can produce various outputs, ranging from simple binaries and assembly over an abstract syntax tree (parse tree) to estimations of gas usage. | ||
| If you only want to compile a single file, you run it as ``solc --bin sourceFile.sol`` and it will print the binary. Before you deploy your contract, activate the optimizer while compiling using ``solc --optimize --bin sourceFile.sol``. If you want to get some of the more advanced output variants of ``solc``, it is probably better to tell it to output everything to separate files using ``solc -o outputDirectory --bin --ast --asm sourceFile.sol``. | ||
|
|
||
| The commandline compiler will automatically read imported files from the filesystem, but | ||
| it is also possible to provide path redirects using ``prefix=path`` in the following way: | ||
|
|
||
| :: | ||
|
|
||
| solc github.com/ethereum/dapp-bin/=/usr/local/lib/dapp-bin/ =/usr/local/lib/fallback file.sol | ||
|
|
||
| This essentially instructs the compiler to search for anything starting with | ||
| ``github.com/ethereum/dapp-bin/`` under ``/usr/local/lib/dapp-bin`` and if it does not | ||
| find the file there, it will look at ``/usr/local/lib/fallback`` (the empty prefix | ||
| always matches). ``solc`` will not read files from the filesystem that lie outside of | ||
| the remapping targets and outside of the directories where explicitly specified source | ||
| files reside, so things like ``import "/etc/passwd";`` only work if you add ``=/`` as a remapping. | ||
|
|
||
| If there are multiple matches due to remappings, the one with the longest common prefix is selected. | ||
|
|
||
| If your contracts use :ref:`libraries <libraries>`, you will notice that the bytecode contains substrings of the form ``__LibraryName______``. You can use ``solc`` as a linker meaning that it will insert the library addresses for you at those points: | ||
|
|
||
| Either add ``--libraries "Math:0x12345678901234567890 Heap:0xabcdef0123456"`` to your command to provide an address for each library or store the string in a file (one library per line) and run ``solc`` using ``--libraries fileName``. | ||
|
|
||
| If ``solc`` is called with the option ``--link``, all input files are interpreted to be unlinked binaries (hex-encoded) in the ``__LibraryName____``-format given above and are linked in-place (if the input is read from stdin, it is written to stdout). All options except ``--libraries`` are ignored (including ``-o``) in this case. | ||
|
|
||
|
|
||
| ************************************************** | ||
| Standardized Input Description and Metadata Output | ||
| ************************************************** | ||
|
|
||
| In order to ease source code verification of complex contracts that are spread across several files, | ||
| there is a standardized way for describing the relations between those files. | ||
| Furthermore, the compiler can generate a json file while compiling that includes | ||
| the source, natspec comments and other metadata whose hash is included in the | ||
| actual bytecode. Specifically, the creation data for a contract has to begin with | ||
| `push32 <metadata hash> pop`. | ||
|
|
||
| There is some overlap between the input description and the metadata output | ||
| and due to the fact that some fields are optional, the metadata can be used as | ||
| input to the compiler. In order to verify the metadata, you actually take it, | ||
| re-run the compiler on the metadata and check that it again produces the same | ||
| metadata. | ||
|
|
||
| If the compiler is invoked in a different way, not using the input | ||
| description (for example by using a file content retrieval callback), | ||
| the compiler can still generate the metadata alongside the bytecode of each | ||
| contract. | ||
|
|
||
| The metadata standard is versioned. Future versions are only required to provide the "version" field, | ||
| the "language" field and the two keys inside the "compiler" field. | ||
| The field compiler.keccak should be the keccak hash of a binary of the compiler with the given version. | ||
|
|
||
| The example below is presented in a human-readable way. Properly formatted metadata | ||
| should use quotes correctly, reduce whitespace to a minimum and sort the keys of all objects | ||
| to arrive at a unique formatting. | ||
|
|
||
| Comments are of course not permitted and used here only for explanatory purposes. | ||
|
|
||
| Input Description | ||
| ----------------- | ||
|
|
||
| The input description is language-specific and could change with each compiler version, but it | ||
| should be backwards compatible if possible. | ||
|
|
||
| { | ||
| sources: | ||
| { | ||
| "abc": "contract b{}", | ||
| "def": {keccak: "0x123..."}, // source has to be retrieved by its hash | ||
| "dir/file.sol": "contract a {}" | ||
| }, | ||
| settings: | ||
| { | ||
| remappings: [":g/dir"], | ||
| optimizer: {enabled: true, runs: 500}, | ||
| compilationTarget: "myFile.sol:MyContract", // Can also be an array | ||
| // To be backwards compatible, use the full name of the contract in the output | ||
| // only if there are name clashes. | ||
| // If the full name was given as "compilationTargets", use the full name. | ||
| libraries: { | ||
| "def:MyLib": "0x123123..." | ||
| }, | ||
| // The following can be used to restrict the fields the compiler will output. | ||
| outputSelection: { | ||
| // to be defined | ||
| } | ||
| } | ||
| } | ||
|
|
||
| Metadata Output | ||
| --------------- | ||
|
|
||
| Note that the actual bytecode is not part of the metadata because the hash | ||
| of the metadata structure will be included in the bytecode itself. | ||
|
|
||
| This requires the compiler to be able to compute the hash of its own binary, | ||
| which requires it to be statically linked. The hash of the binary is not | ||
| too important. It is much more important to have the commit hash because | ||
| that can be used to query a location of the binary (and whether the version is | ||
| "official") at a registry contract. | ||
|
|
||
| { | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. dont forget the language. (language version maybe even?)
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I actually did not think of this as a generic thing, but it might very well be :-) |
||
| version: "1", | ||
| language: "Solidity", | ||
| compiler: { | ||
| commit: "55db20e32c97098d13230ab7500758e8e3b31d64", | ||
| version: "soljson-2313-2016-12-12", | ||
| keccak: "0x123..." | ||
| }, | ||
| sources: | ||
| { | ||
| "abc": "contract b{}", | ||
| "def": {keccak: "0x123..."}, // source has to be retrieved by its hash | ||
| "dir/file.sol": "contract a {}" | ||
| }, | ||
| settings: | ||
| { | ||
| remappings: [":g/dir"], | ||
| optimizer: {enabled: true, runs: 500}, | ||
| compilationTarget: "myFile.sol:MyContract", | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. now this should really not be here
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Why do you think so? If there are multiple contracts (or libraries) in the input, we have to designate which contract we are referring to. |
||
| // To be backwards compatible, use the full name of the contract in the output | ||
| // only if there are name clashes. | ||
| // If the full name was given as "compilationTargets", use the full name. | ||
| libraries: { | ||
| "def:MyLib": "0x123123..." | ||
| } | ||
| }, | ||
| output: | ||
| { | ||
| abi: [ /* abi definition */ ], | ||
| userDocumentation: [ /* user documentation comments */ ], | ||
| developerDocumentation: [ /* developer documentation comments */ ], | ||
| natspec: [ /* natspec comments */ ] | ||
| } | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Proposed names for the rest:
I'm not fully sure about source maps?
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ignore the
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I am actually not sure how much to include here. As the metadata output is mainly used for source verification, you can anyway re-generate everything here by re-running the compiler. I would only include the essential information here which are needed even in the absence of a compiler like the ones given above. The general output format of the compiler should be a superset of the metadata output or at least largely overlapping. This general output can contain exactly the fields you mention.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. To be honest I mistook this PR for the compiler input/output and only realised after that it is for verification purposes. Should we open a issue / PR / gist for the compiler input/output? |
||
| } | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
like!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As mentioned in the Gist, I think this should have a (optional)
type/kind/networkfield to signal where to look for it, i.e."swarm","ipfs".