Finalized Python Package #57

jbremer · 2016-06-30T11:04:53Z

This PR includes both PRs from @Titotix and some more love in order to make peepdf a true Python package, with its core library accessible through import peepdf and the peepdf.py command exposed as the global (or local in case of virtualenv) peepdf command.

In addition to that this PR modifies a few minor things:

Normalizes the version number to something which gives less problems with Python packages (in this case, version 0.3.1). This will have to be increased for every release you do.
Removal of the update functionality, this may be replaced simply by pip install -U peepdf.
Move error logging file from $GIT/errors.txt to ~/.peepdf-error.txt.

I think that's most of it. Now I'd be very happy if you could merge this PR and upload it to PyPI (python setup.py register && python setup.py sdist upload). I'm also happy to do that for you, but as you're the developer of this library I think it would make most sense if you handle this yourself.

Let me know if there are any questions.

Regards,
Jurriaan

Create subrepo peepdf which contains all src code. This in order to create a full module peepdf with setuptools in setup.py

…the new stats after resolving objects

…lement

…s not detected as JS BUT is referenced from a /JS element. Modified the PDFFile.decrypt function to update the global stats before decrypting the objects.

…which JS code/URI

…er object too

… and JS code

…ays visible

…put per version, so we don't lose that information

…s_analyse, js_eval, js_unescape, etc

…nging the stream object when we call setReferencedJSObject (it was giving bad results with encryption)

…hanks to KurtPfeifle for the feedback ;)

…e from the command line (really basic implementation)

Allow some badchars in the JS code

Due to two characteristics of PDF streams the new isJavascript() variant became extremely slow. First of all, we want to allow Javascript with *some* bad characters. E.g., a trademark or unicode apostrophe would already mark a Javascript stream as not being Javascript - this was identified and fixed by Fernando Dominguez for the 0.3.5 release. Secondly, due to the nature of the above, some streams were incorrectly let through. This was caused by "\x00" not being counted as an incorrect character and therefore the 10-20% quota not being met. Combined these two issues let to a major performance issue, as whole streams were being processed and included in the output. This commit fixes those performance issues and replaces the Python character counting by a regex query that's slightly faster.

Fixes a serious performance bottleneck. More information may be found at the following commits by Brad Spengler [1][2][3]. [1]: spender-sandbox/cuckoo-modified@1d3bdee [2]: spender-sandbox/cuckoo-modified@d1a44d8 [3]: spender-sandbox/cuckoo-modified@7e2f521

#7

jpic · 2019-10-30T22:34:57Z

Amazing, thanks for maintaining the library on Pypi ! 🎩

Titotix and others added 30 commits February 17, 2016 13:20

intoduce setup.py

156d618

Change project structure

fe3efc4

Create subrepo peepdf which contains all src code. This in order to create a full module peepdf with setuptools in setup.py

setup.py: module in lower case

32dff13

import jsbeautify as external lib

76b6600

import colorama as external lib

362889a

aespython as external lib

adc512c

Improved Javascript detection in function isJavascript

1470253

Added updateStats call in PDFBody.updateObjects to take into account …

18fe76b

…the new stats after resolving objects

Added in the output the total number of objects per suspicious element

8225f47

Added in the info output the total number of objects per suspicious e…

9fb0872

…lement

Some PEP8 cleaning

36227a8

Added element to mark an object as containingJS even if the content i…

0fa6409

…s not detected as JS BUT is referenced from a /JS element. Modified the PDFFile.decrypt function to update the global stats before decrypting the objects.

Revision changed

646c856

Added new variable and getter to keep track of which object contains …

8fb9cb6

…which JS code/URI

Modified PDFBody.getJavascriptCode to be able to return the js code p…

be9cfa6

…er object too

Modified PDFFile.getURIs to be able to return the uris per object too

2713210

Modified do_extract to show information about objects containing URIs…

18048f9

… and JS code

Modified PDFDictionary.update to force not detected JS code to be alw…

2709c48

…ays visible

Modified PDFFile.getJavascriptCode and PDFFile.getURIs to give an out…

b9811df

…put per version, so we don't lose that information

Add version information in the extract command output

39d59f4

Added TODO line and revision change

408b770

Added a direct string argument to several commands: decode, encode, j…

f07430b

…s_analyse, js_eval, js_unescape, etc

Fixed some variable reference errors

ce2299d

Revision change

f75d77a

Added error handling in computeUserPass if revision < 2

944cfb7

PEP8 changes in PDFCrypto

a82f8b8

Added custom function setReferencedJSObject in PDFStream to avoid cha…

2bb9e39

…nging the stream object when we call setReferencedJSObject (it was giving bad results with encryption)

Fixed issue jesparza#36 (bad redirection output in filter command). T…

c403dcd

…hanks to KurtPfeifle for the feedback ;)

Added new argument -C to execute commands from the interactive consol…

006dd9c

…e from the command line (really basic implementation)

Updated copyright years

6efd14c

Fernando and others added 7 commits June 9, 2017 16:09

Allow up to 20% badchars

2d436eb

Add a JS detection test

afa849f

Remove unused vars

8a365f4

Merge pull request #5 from FernandoDoming/patch-1

f944bcb

Allow some badchars in the JS code

version 0.3.5

6915a70

version 0.3.6

80748f5

jbremer force-pushed the master branch from c05d61b to 80748f5 Compare June 10, 2017 21:39

Jurriaan Bremer and others added 16 commits December 12, 2017 16:01

get rid of superfluous global statements

73ab841

version 0.3.7

12c7b19

Passing all tests

8cc27b6

Making a little progress with py3 compatability

a456f52

Progress...

1899e90

Wohoo all 4 tests succeeding at sflock

66b5d36

All tests on peepdf and slfock/test_pdf working :D

bb03df1

code cleanup

139da0a

code cleanup

9833d27

version 0.4.0

2002565

install future for py2/3 compatibility

6aa9692

version 0.4.1 with future included

a63d628

py35/py36/macos unit testing

8d6a370

fix object not found issue #7 (thanks Wyatt Roersma)

f476b4d

#7

ignore presumable ghostscript streams for jsbeautifier

e727ac7

jbremer force-pushed the master branch from 2b1c3d7 to e727ac7 Compare June 11, 2018 21:11

version 0.4.2

11dab2e

jbremer deleted the branch jesparza:master December 8, 2021 10:24

jbremer closed this Dec 8, 2021

jbremer deleted the master branch December 8, 2021 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finalized Python Package #57

Finalized Python Package #57

Uh oh!

jbremer commented Jun 30, 2016

Uh oh!

jpic commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Finalized Python Package #57

Finalized Python Package #57

Uh oh!

Conversation

jbremer commented Jun 30, 2016

Uh oh!

jpic commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants