-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
taxpasta: TAXonomic Profile Aggregation and STAndardisation #84
Comments
hey there 👋 @Midnighter welcome to pyopensci and thank you for this submission! i just wanted to say hello so you know that we see this submission and will be getting back to you shortly! i also am looking into forms for the submission template as well - many thanks for that suggestion! More soon! |
Welcome @Midnighter @sofstam @jfy133! Glad to see this as someone who develops a similar tool (for a different domain!). Looks like most everything is there. I do want to request one thing before we proceed with a review though: Something like the "Examples" section in the Pynteny docs here (which also has a CLI interface): https://robaina.github.io/Pynteny/examples/example_cli/ (These docs were in place before we began the Pynteny review, which is why I'm providing them as an example of what we need to see before we start.) I would suggest a sort of walkthrough of the main use case for taxpasta. This might be a simplified version of something you're using it for in research already. I have provided some other feedback below but adding an initial tutorial is the only thing that's necessary at this time. Editor in Chief checksHi there! Thank you for submitting your package for pyOpenSci Please check our Python packaging guide for more information on the elements below.
Editor commentsThis is not required in checks but I would suggest adding example data to taxpasta. For my own library I just add the files directly to the package and then access with |
Also @sofstam @jfy133 could you both please fill out the pre-review survey? |
Hi @NickleDave, Thank you for your review and comments. Adding a tutorial and improving the docs mostly makes sense to me. I will work on adding those. The tables are indeed small, still I'm a little hesitant to distribute them with the package. However, pandas should be able to just load a table from a URL so that's maybe a way to go.
|
oops, sorry - from the second page most of the questions seemed to be about the package and I assume Moritz had already provided all that info. I've filled it out, but left pretty much all the optional questions empty as either you have that info from Moritz, and/or submitting via pyOpenSci is Moritz' initative so I'm not that familiar/involved with python etc. One comment though: I noticed that the options in your 'What background best describes you cultural identity?' question is extremely N. American focused. Unfortunately I don't have a good solution for you, but you may not be getting a particularly good overview of this - e.g. Asian spans half the world with many equally large sub-divisons as the Pacific Islanders, and also grouping 'Black' and African-American is also arguably unfair as they also can have a large difference in backgrounds/problems etc. For example see what official surveys from UK use: https://www.ethnicity-facts-figures.service.gov.uk/style-guide/ethnic-groups |
@NickleDave regarding your comments on |
Great, thank you @Midnighter. Your other comments re: tables, bioconda, Zenodo, and the truncation all sound good. Would it be worth making rich a dependency so you don't have to special case typer behavior?
Thank you @jfy133 I hear you and I will relay this to our executive director @lwasser. Your feedback is helpful; this is definitely a version 0.1 of the survey and I'm sure we can improve it to not be US-centric, and to encompass people who are core contributors to a project even if they are not primarily Python developers. |
At the moment, it's an extra dependency that can be installed with |
Understood! Btw @Midnighter could I ask you to link this review issue on any issues you raise to make changes before review? By adding a link to the issue on the taxpasta repo, so that GitHub cross references them. Just to help us track. Thank you 🙏 |
Dear @NickleDave, We think that we've implemented everything that you've noticed. Please take another look and let us know your thoughts.
|
Hi @Midnighter I did check out these additions -- looks great! Thank you for addressing all of those comments. |
Hi again @Midnighter very happy to say that @ctb has very kindly agreed to take on the editor role for this review. Thanks for your patience while we found someone who knows your domain and the software community around it. I will let @ctb take it from here and start tagging in reviewers. |
I guess we'll have to add support for sourmash then 😆 Good to hear and I look forward to your review @ctb. Thank you for your time. |
welcome! ironically we have invested time in generating the same report formats as you, for other consumers of taxonomy, so I think we are well positioned to engage productively ;). |
hey y'all just wanted to note that i'm working on the survey questions being a LOT more inclusive. Many thanksf or your patience there and THANK YOU for calling this to our attention! It's very much american-centric |
(finally got around to asking second reviewer; response soon :)) |
Editor comments👋 Hi @bluegenes and @snacktavish! Thank you for volunteering to review The following resources will help you complete your review:
Please get in touch with any questions or concerns! Your review is due: April 21st, 2023.Reviewers: @bluegenes @snacktavish |
ok, trying tagging in @bluegenes and @snacktavish again. Or do we need to invite them to this repo also? |
FYI, we just released version 0.4.0 that wraps the changes from the reviews, as well as further issues discovered by users. |
ok, here goes: 🎉 taxpasta has been approved by pyOpenSci! Thank you @Midnighter for submitting and many thanks to @snacktavish @bluegenes for reviewing this package! 😸 Author Wrap Up TasksThere are a few things left to do to wrap up this submission:
It looks like you would like to submit this package to JOSS. Here are the next steps:
🎉 Congratulations! You are now published with both JOSS and pyOpenSci! 🎉 Editor Final ChecksPlease complete the final steps to wrap up this review. Editor, please do the following:
If you have any feedback for us about the review process please feel free to share it here. We are always looking to improve our process and documentation in the peer-review-guide. |
Added badge and confirm I've filled out the post-review survey! |
Yes, it has passed review! At least one reviewer has checked all the criteria. |
@ctb thanks for the reply. Note though that @NickleDave |
there are so many check boxes 😆 but also, I think:
could maybe best happen after JOSS paper is available. Similarly,
I felt needed to wait until there was a JOSS issue. In addition, I don't think the authors have the ability to check off those boxes so it had to wait until THEY did the work and then I noticed that they had done so => check the boxes. For the other checklist items like adding badges and so on, those were done after I signed off on the review and I don't get notified of PRs etc. referencing this issue so barring monitoring this issue manually or being tagged in on the PRs, I'm not sure how I would go about knowing I should check them off... (I don't mean to be defensive - I am just explaining where we are and why, and confirming to myself that I had read the unchecked boxes and made the decision not to check them off yet, instead of missing them entirely, which was also certainly a possibility!) |
Correct, at least I was unable to tick of the boxes of what was meant to be our tasks (as authors) |
ahhhh ok lessons learned here that we need a way for authors to check their boxes AND/OR we need fewer of them . thank you all for the feedback!!! i hadn't considered that authors would NOT be able to check those author list boxes - we will have to come up with a better approach! i'm open to suggestions. and it looks like taxpasta was accepted by JOSS - hooray. 🎉 🎉 |
🎉 ! Thanks @lwasser ! |
awesome! ok i'll close this issue now. Friends - if anyone from this review (reviewers, maintainers etc) is interested in joining our slack community please email me at leah at pyopensci.org or reply here if your email is available on your github and i can add you!! we'd love to have all of you there! |
@lwasser One of the JOSS AEiCs here. Thanks for the great work with pyOpenSci. |
Submitting Author: Moritz E. Beber (@Midnighter)
All current maintainers: (@Midnighter, @sofstam, @jfy133)
Package Name: taxpasta
One-Line Description of Package: TAXonomic Profile Aggregation and STAndardisation
Repository Link: https://github.com/taxprofiler/taxpasta
Version submitted: 0.2.1
Editor: @ctb
Reviewer 1: @snacktavish
Reviewer 2: @bluegenes
Archive: https://github.com/taxprofiler/taxpasta/releases/tag/0.4.0
JOSS DOI:
Version accepted: 0.4.0
Date accepted (month/day/year): 07/05/2023
Code of Conduct & Commitment to Maintain Package
Description
The main purpose of taxpasta is to standardise taxonomic profiles created by a
range of bioinformatics tools. We call those tools taxonomic profilers. They
each come with their own particular, tabular output format. Across the profilers,
relative abundances can be reported in read counts, fractions, or percentages,
as well as any number of additional columns with extra information. We therefore
decided to take the lessons learnt to heart and provide
our own solution to deal with this pasticcio. With taxpasta you can ingest all
of those formats and, at a minimum, output taxonomy identifiers and their
integer counts.
Taxpasta can not only standardise profiles but also merge them across samples
for the same profiler into a single table. In future, we also intend to offer
methods for forming a consensus for the same sample analyzed by different
profilers.
Scope
Please indicate which category or categories.
Check out our package scope page to learn more about our
scope. (If you are unsure of which category you fit, we suggest you make a pre-submission inquiry):
Domain Specific & Community Partnerships
Community Partnerships
If your package is associated with an
existing community please check below:
For all submissions, explain how the and why the package falls under the categories you indicated above. In your explanation, please address the following points (briefly, 1-2 sentences for each):
Who is the target audience and what are scientific applications of this package?
Taxpasta is a tool for anyone working with taxonomic profiles from metagenomic sequencing experiments. Mostly that means ecologists, bioinformaticians, statisticians. Taxpasta's main application is to standardise profiles from a range of different tools. Having a singular format facilitates downstream analyses. Taxpasta is used, for example, in the upcoming taxprofiler pipeline implemented in nextflow. There, it also serves to combine the profiles of many samples into a single file.
Are there other Python packages that accomplish the same thing? If so, how does yours differ?
The BIOM format was created with the intention of standardizing a storage format for microbiome analyses. However, creating this format was entirely left to the user. Taxpasta conveniently knows how to read profiles from a range of tools and can also produce BIOM output.
Some of the taxonomic profilers also come with scripts to convert their output into another format but none of them support such a wide range of tools as taxpasta does.
If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or
@tag
the editor you contacted:Technical checks
For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:
CONTRIBUTING.rst
).Publication Options
JOSS Checks
paper.md
matching JOSS's requirements with a high-level description in the package root or ininst/
.Note: JOSS accepts our review as theirs. You will NOT need to go through another full review. JOSS will only review your paper.md file. Be sure to link to this pyOpenSci issue when a JOSS issue is opened for your package. Also be sure to tell the JOSS editor that this is a pyOpenSci reviewed package once you reach this step.
Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?
This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.
Confirm each of the following by checking the box.
Please fill out our survey
submission and improve our peer review process. We will also ask our reviewers
and editors to fill this out.
P.S. *Have feedback/comments about our review process? Leave a comment here
Editor and Review Templates
The [editor template can be found here][Editor Template].
The [review template can be found here][Review Template].
The text was updated successfully, but these errors were encountered: