Open
Milestone
Issues en vrac
Unstarted Issues (open and unassigned)
20
- Améliorations possibles du pipeline
- Improve HiFi binning
- tester https://github.com/GaetanBenoitDev/metaMDBG pour assembler les reads HiFi
- Improvment siggested by Maria
- binning : benschmark of other tools / methods
- check if assembly filter is working well when co-assembly is configured and add the choice to not filter on cpm but in contig length
- provide the possibility to give the metaflye infos of circular contigs as input at the same time as the assembly
- Duplicate clustering if coassembly all samples
- Refactoring
- Add info if bin is RNA complete or not
- Minimap2 requires an extra parameter for large reference (>4GB)
- CDHIT on aa protein sequences rather than on nucleotides protein sequences
- Re-activate the workflow installation with conda
- Add taxonomic affiliation plots to multiqc
- Eggnog mapper chunks
- Questions to answer with the pipeline (biologist)
- Bins and contigs: generate matrix with metrics
- Generate .html to make documentation website
- Clustering: improve table_clstr.txt
- Replace diamond by MMseq2? (benchmark)
Ongoing Issues (open and assigned)
8
- improve checkM2 database managing
- Make binning results compatible with anvio
- Functional test: check process and image, nextflow stub and gitlab CI/CD
- Databases saving folder
- Functional tests: change output to table format
- Documentation: update use_case.md with interpretations
- 03_filtering: Assess the impact of CPM and contig length filtering
- Metrics on final quantification file with annotations
Completed Issues (closed)
18
- La création de l'environnement conda metagWGS ne marche pas sur genotoul.
- Singularity: find a solution to have a relative env folder usage
- busco DB link initialization error (use --skip_busco)
- Fix column names for single sample
- Simplify software version scrapping
- Check the consistency between the results files
- check and correct differences in (assembly) results for functional tests
- Understanding why RGI (with prodigal) predicts more genes than metagWGS (with prokka)
- Sample names in MultiQC report
- Simplifier les paramètres de taxonomy
- Multiple singularity files (one by process)
- Usage of Markdown for use case output integration
- "split" Step 05_alignment
- Verify DIAMOND best hit strategy
- Add taxonomy db parameters
- Taxonomic affiliation of contigs and bins exploration
- nextflow.config file: manage memory and CPUs
- Transformation of prokka gff file in gtf file to take into account introns
Loading
Loading
Loading