The ProteinFileReader DLL can be used to open a FASTA file or delimited text file with protein names and sequences and return each protein present.
- It can read both plain text files and gzipped files, for example E_coli_K12_UniProt.fasta.gz
- Supports both tab-delimited and comma separated value files, including .csv files with double quoted strings with internal commas
ProteinFileReader.Dll is available on NuGet at:
The reader depends on NuGet package CsvHelper
The latest versions of the DLL is available for six months on the AppVeyor CI server
Source code is available on GitHub at:
Written by Matthew Monroe for the Department of Energy (PNNL, Richland, WA)
E-mail: matthew.monroe@pnnl.gov or proteomics@pnnl.gov
Website: https://github.com/PNNL-Comp-Mass-Spec/ or https://www.pnnl.gov/integrative-omics
Licensed under the Apache License, Version 2.0; you may not use this program except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0