Welcome to DeepRVAT’s documentation!
Rare variant association testing using deep learning and data-driven gene impairment scores.
_Coming soon:_ Overview of the DeepRVAT methodaster
How to use this documentation
First, see Installation.
Visit Quick start if you want to skip the detailed documentation and dive right in, or if you want to quickly check that the package installed correctly.
To run DeepRVAT on your data, first consult Modes of usage then visit the section relevant to your use case.
For all modes, you’ll want to consult Input data and configuration.
Note also that for all modes of usage other than association testing with precomputed gene impairment scores, you’ll need to preprocess your genotype data, followed by annotating your variants.
To train custom DeepRVAT models, rather than using precomputed gene impairment scores or our provided pretrained models, you’ll need to additionally run seed gene discovery. See also the Practical recommendations for training.
Finally, consult the relevant section for your use case here.
If running DeepRVAT on a cluster (recommended), some helpful tips are here.
Citation
If you use this package, please cite:
Clarke, Holtkamp et al., “Integration of Variant Annotations Using Deep Set Networks Boosts Rare Variant Association Testing.” Nature Genetics. https://www.nature.com/articles/s41588-024-01919-z
Contact
To report a bug or make a feature request, please create an issue on GitHub.
Contents:
- Installation
- Quick start
- Modes of usage
- DeepRVAT preprocessing pipeline
- DeepRVAT annotation pipeline
- Seed gene discovery
- DeepRVAT configuration and input data
- Association testing with precomputed gene impairment scores
- Association testing using pretrained DeepRVAT models
- Training and association testing
- Output file formats
- Cluster execution
- Practical recommendations for training
- Applying DeepRVAT to UK Biobank data
- API Reference