Skip to content

Class for reading Protein files, specifically .fasta files and certain tab-delimited formats

Notifications You must be signed in to change notification settings

PNNL-Comp-Mass-Spec/Protein-File-Reader

Repository files navigation

The ProteinFileReader DLL can be used to open a FASTA file or delimited text file with protein names and sequences and return each protein present.

  • It can read both plain text files and gzipped files, for example E_coli_K12_UniProt.fasta.gz
  • Supports both tab-delimited and comma separated value files, including .csv files with double quoted strings with internal commas

NuGet

ProteinFileReader.Dll is available on NuGet at:

The reader depends on NuGet package CsvHelper

Continuous Integration

The latest versions of the DLL is available for six months on the AppVeyor CI server

Build status

Source Code

Source code is available on GitHub at:

Contacts

Written by Matthew Monroe for the Department of Energy (PNNL, Richland, WA)
E-mail: matthew.monroe@pnnl.gov or proteomics@pnnl.gov
Website: https://github.com/PNNL-Comp-Mass-Spec/ or https://www.pnnl.gov/integrative-omics

License

Licensed under the Apache License, Version 2.0; you may not use this program except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0