Probabilistic Assembly with Mycelia
Mycelia's probabilistic assembly approach combines probabilistic modelling with biological insight to deliver assemblies with confidence intervals.
What is Probabilistic Assembly?
Traditional Assembly
Traditional assemblers make deterministic decisions at each step:
- Fixed k-mer size throughout assembly
- Binary decisions: keep or discard sequences
- Quality scores often ignored after initial filtering
- Heuristic-based error correction
Probabilistic Assembly
Mycelia's probabilistic approach treats assembly as a statistical inference problem:
- Dynamic k-mer progression based on data characteristics
- Probabilistic path selection using quality scores
- Maximum likelihood error correction
- Self-optimizing parameters through machine learning
Traditional: Reads → Graph → Heuristic Cleaning → Assembly
Probabilistic: Reads → Quality-Aware Graph → Statistical Inference → Optimal AssemblyWhy Use Probabilistic Assembly?
1. High Accuracy
- Preserves quality information throughout assembly
- Statistically principled error correction
2. Self-Optimizing
- No manual parameter tuning required
- Automatically selects optimal k-mer sizes
- Adapts to your data's characteristics
3. Quality-Aware
- Implements per-base quality score preservation
- Handles varying coverage gracefully
Choose Your Path
By Data Type
graph TD
A[What type of data do you have?] --> B[FASTA only<br/>No quality scores]
A --> C[FASTQ<br/>With quality scores]
A --> D[Mixed<br/>Short + long reads]
B --> E[K-mer Graphs<br/>→ Tutorial 1]
C --> F[Qualmer Graphs<br/>→ Tutorial 2]
D --> G[Hybrid Assembly<br/>→ Tutorial 3]By Experience Level
Beginner - "I just want it to work"
→ Start with Assembly in 5 Minutes
- Automatic parameter selection
- Simple one-function interface
- Clear output interpretation
Intermediate - "I want to understand and optimize"
→ Continue to Understanding Assembly Methods
- Compare different approaches
- Tune for your specific needs
- Interpret quality metrics
Advanced - "I want full control"
→ Explore Advanced Assembly Theory
- Custom graph algorithms
- Machine learning integration
- Novel method development
Each Method
Intelligent Assembly
- Automatic parameter optimization
- Good for unknown data characteristics
- Balances speed and accuracy
Iterative Assembly
- Best for low-quality data
- Refines assembly through iterations
- Takes more time
Reinforcement Learning Assembly
- Learns from experience
- Adapts to new data types
- Experimental feature
Next Steps
Ready to Start?
- Install Mycelia - Multiple installation options
- Run the quick start tutorial - See it in action
- Choose your workflow - Find the best approach
Need Help?
- FAQ - Common questions and troubleshooting
- Community Forum - Ask questions
- Report Issues - Help us improve
Want to Learn More?
- Mathematical Foundations - The theory behind the methods
- Benchmarks - Performance comparisons
- Case Studies - Real-world applications
Why Mycelia?
Scientific Innovation
- 6-graph hierarchy - Unifies multiple assembly approaches
- Quality preservation - Maintains quality throughout assembly
- Principled algorithms - Based on proven statistical methods
Practical Benefits
- No parameter tuning - It should just work
- Transparent results - Data-driven accuracy and completeness
- Active development - Regular updates and improvements
Open Source
- Free to use - MIT licensed
- Community driven - Contributions welcome
- Transparent - All algorithms documented
Ready to get started? Begin with the quick start tutorial →