This page is intended to provide cancer researchers with a variety of resources to aid in their research. This page will contain the following: Cancer sample databases, Cancer analysis tools, and Cancer Atlases.
Cancer Sample Databases
Here you will find various databases which users can search and download publicly available cancer data. If this is something you would like to do, i.e. download, process, and analyse raw sequencing data from a specific study, then please contact me! (GibbsA@cardiff.ac.uk)
Gene Expression Omnibus (GEO)
Public repository that archives and distributes high-throughput gene expression and other functional genomics data sets. It is maintained by the National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM). Link to the Overview. Link to the FAQ. Link on how to download data.
Array Express
The functional genomics data collection stores data from high-throughput functional genomics experiments, and provides data for reuse to the research community. Link to the Help section.
European Nucleotide Archive (ENA)
The European Nucleotide Archive provides a comprehensive record of the world’s nucleotide sequencing information, covering raw sequencing data, sequence assembly information and functional annotation. Link to the support guides page.
Sequence Read Archive (SRA)
Sequence Read Archive data, available through multiple cloud providers and NCBI servers, is the largest publicly available repository of high throughput sequencing data. SRA stores raw sequencing data and alignment information to enhance reproducibility and facilitate new discoveries through data analysis. how to search and download link.
Zenodo
Open research repository. Search for cancer and you get list of uploads associated with papers. Every file that the paper used for the publication is uploaded, such as loom files for RNA velocity analysis etc etc. Link to the Guides.
OmicsDI
Database of sequencing datasets. Home page has lots of quick stats and search bar. Allows user to find databases related to their search and access that data. Link to Help section.
Genomics Data Commons (GDC) Data Portal
Search for your favourite gene or cancer to get relevant statistics. There are a number of great tools on this site! Data is also downloadable for further analysis. Link to the Documentation.
Google Dataset Search Tool
Search for public datasets.
Link to the User Support Page
PanglaoDB
A database for scRNAseq data downloads, but also you can search a gene across datasets.
Link to FAQ/support page
Curated Atlas Query R
A query interface that allow the programmatic exploration and retrieval of the harmonised, curated and reannotated CELLxGENE single-cell human cell atlas. Must be run within R.
3CA - Curated Cancer Cell Atlas
Collected, annotated and analyzed cancer scRNA-seq datasets. Paper.
IMMUcan Database
A fully integrated scRNA-seq database exclusively dedicated to human cancer and accessible to nonspecialists. IMMUcan scDB encompasses 144 datasets on 56 different cancer types, annotated in 50 fields containing precise clinical, technological, and biological information.
JingleBells
A repository of standardized single cell RNA-Seq datasets for analysis and visualization at the single cell level.
scPortalen Database
This database features integration of single cell metadata, cell images and sequence information. Rhe platform is divided into two parts, the single cell dataset generated by the research team, and the single cell dataset published in peer-reviewed papers.
Cancer Single-cell Expression Map
Public database dedicating to collecting, analysing, and visualising single-cell RNAseq data of human cancers.
Database Usage link.
Cancer Analysis Tools
Here you will find links to various tools that allow the user to explore and analyse cancer data. These tools are mostly online, meaning users only need an internet browser and a stable internet connection. However, some are available as downloadable apps or can be run through R or Python. If this something you would like to do but are not sure how, please contact me! (GibbsA@cardiff.ac.uk)
Xena
This tool allows users to explore functional genomic data sets for correlations between genomnic and/or phenotypic variables. Users can choose a study to explore up to two variables. There is also a Transcripts tab which allows the user to visualise the various transcripts of a gene. Link to the Help page.
NCRAS - National Cancer Registration and Analysis Service
The National Cancer Registration and Analysis Service (NCRAS) collects, quality assures and analyses data on all people living in England who are diagnosed with cancer. It is part of the National Disease Registration Service (NDRS) in NHS Digital (NHSD). There are a plethora of tools on here to exploit this data.
Cancer Research UK - Cancer Statistics for the UK
CRUK have a nice, more basic tool on this website to get quick and simple cancer stats.
Cancer Research UK - Early Diagnosis Hub
CRUK also have an early diagnosis hub tool that allows exploration of the latest cancer early diagnosis data across the UK.
OncoLnc
Link TCGA survival data to mRNA, miRNA, or lncRNA expression levels. Link to tutorial video.
CIViC - clinical interpretation of variants in cancer database
Primarily a tool for doctors to get the best treatment options for patients with certain mutations. You can search for specific muations/gene variants and the database gives you a load of information on it. Can also look for specific cancers, specific therapies. Really neat database. Link on how to use CIVIC.
CrossHub
Multi-way analysis of RNAseq, miRNAseq and methylome data from the TCGA project. Python tool. Generates excel summaries.
Cellenics
Open source scRNAseq analysis software online tool to analyse public scRNAseq datasets. Its a cloud based tool and everything is done online. Need to sign up for free. Very neat tool. Click on the link and then the Cellenics tab. Link to the User Guide.
Single Cell Portal
Allows data exploration of curated datasets. scRNAseq and spatial datasets available! Link to Help Page.
UCSC Cell Browser
Interactive viewer for scRNAseq expression. How to use the website.
OmicsDI
Database of sequencing datasets. Home page has lots of quick stats and search bar. Allows user to find databases related to their search and access that data. Tutorial page.
Dependency Map (DepMap) portal
Open access to key cancer dependencies analytical and visualisation tools. Documentation page.
ISOexpresso
Isoform expression resource for isoform expression analysis in cancer.
TCGA Spliceseq
Cancer splicing visualised. FAQ page.
Genomics Data Commons (GDC) Data Portal
Search for your favourite gene or cancer to get relevant statistics. There are a number of great tools on this site! Data is also downloadable for further analysis. Link to the Documentation.
DriverDBv4
Database for human cancer driver gene research Link to Help Page.
List of PPI resources
A list of 375 protein-protein interaction resources were compiled through extensive literature search. Basic features, URL, publication date, and corresponding article(s) for each resource are also listed by browsing through their web-pages.
Cancer cell maps
Selected set of human cancer focused pathways.
Tumour-suppressor Gene Database (TSGene)
Comprehensive resource for pan-analysis of human tumour suppressor genes (TSGs). Link to Help Page.
COSMIC
Catalogue of somatic mutations in cancer. Link to Help Page.
Cancer Hotspots
Resource for statistically significant recurrent mutations in cancer.
CancerMine
Literature-mined database of drivers, oncogenes and tumour suppressors in cancer.
Cancer cell metabolism gene DB
Comprehensive annotation resource for cell metabolism genes in cancer. Link to Help Page.
DisGeNET
Discovery platform containing one of the largest publicly available collections of genes and variants associated to human diseases Link to Help Page.
Cancer 3D
Patterns of mutations in cancer. Database provides an open and user-friendly way to analyse cancer missense mutations in the context of structures of proteins they are found in and in relation to patients gener and age. Link to Tutorial.
GREIN
GEO RNAseq Experiments Interactive Navigator. Interactive web platform that provides user-friendly options to explore and analyse GEO RNAseq data.
DISCO - deeply integrated human single-cell omics data
Massive database. Three tools available on there too for scRNAseq data. CELLiD: cell type annotation, CellMapper:data transfer, and scEnrichment: GSEA. Link to Vignette/Tutorial.
cellTypist
It is a cell annotation tool but also have a database for the markers. Link to Tutorials.
Cellxgene
Download and/or visually explore reference-quality data to understand the functionality of human tissues at the cellular level with Chan Zuckerberg CELL by GENE Discover (CZ CELLxGENE Discover). Users can search for data collections, browse data sets, explore single cell gene expression, and obtain information about specific cell types. Really neat tool! Help & Documentations Page.
IMMUcan Database
A fully integrated scRNA-seq database exclusively dedicated to human cancer and accessible to nonspecialists. IMMUcan scDB encompasses 144 datasets on 56 different cancer types, annotated in 50 fields containing precise clinical, technological, and biological information.
scPortalen Database
This database features integration of single cell metadata, cell images and sequence information. Rhe platform is divided into two parts, the single cell dataset generated by the research team, and the single cell dataset published in peer-reviewed papers.
Cancer Single-cell Expression Map
Public database dedicating to collecting, analysing, and visualising single-cell RNAseq data of human cancers. Link to Help Page.
TISCH2: Tumour Immmune Single-cell Hub 2
Tumor Immune Single-cell Hub 2 (TISCH2) is a scRNA-seq database focusing on tumor microenvironment (TME). TISCH2 provides detailed cell-type annotation at the single-cell level, enabling the exploration of TME across different cancer types. Link to documentation Page.
Kaplan-Meier Plotter
The Kaplan Meier plotter is capable of assessing the correlation between the expression of all genes (mRNA, miRNA, protein, & DNA) and survival in 35k+ samples from 21 tumor types. Applied statistical tools include Cox proportional hazards regression and the computation of the False Discovery Rate. With 20,000 analyses per day, the KM-plotter is a worldwide reference for the discovery and validation of survival biomarkers.
ROC Plotter
The ROC plotter is capable of linking gene expression and therapy response using transcriptome-level data of breast, ovarian, and colorectal cancer patients and glioblastomas. The custom plotter generates a ROC plot for user-uploaded data.
muTarget
muTarget is a cancer biomarker / target discovery tool with two major functions
1) With a “Genotype” run one can identify gene(s) showing altered expression in samples harbouring a mutated input gene. This option is useful in case one searches new drug targets in a cohort of patients with a given mutation.
2) With a “Target” run one can identify mutations resulting in expression change in the input gene. This option is useful in case one has a drug target gene, and patient cohorts with enriched expression is the question.
Cancer Hallmarks
What are the Hallmarks of Cancer?
The “Hallmarks of Cancer” concept provides a framework for understanding fundamental organizing principles common to various cancers. Understanding hallmarks can benefit cancer prevention, diagnosis, and treatment development by summarizing functional and metabolic commonalities underlying malignant transformation and progression.
What can CancerHallmarks do?
We established a consensus list of cancer hallmark genes by merging 6,763 genes from available mapping resources. CancerHallmarks.com enhances the utility of the hallmark concept as an effective organizational tool by funneling genes to biological functions.
TNMplot: differential gene expression analysis in Tumour, Normal, and Metastatic tissues
Realy nice tool! Performs a whole range of pan-cancer analyses and provides multiple different outputs such as dot plots, bar charts, correlation plots, box plots and more!
Cancer Atlases
Here you will find links to various cancer atlases.
St Judes
St Judes has a collection of tools: Link to the Help Guides.
St Judes Genomics platform Has data browser - one of the wworlds most comprehensive repositories of pediatric cancer genomics data. Has analysis workflows to analyse the genomics data.
PECAN - curated pediatric cancer genomics data including variants, mutational signatures, and gene expression data in addition to histological slide images from ~9000 hematological, CAN, and non-CNS solid tumour patient samples.
Xenograft model systems tool - Explore patient derived xenograft data and cell lines generated at st Jude to enhance basic research and speed translation to the clinic.
Theres also a visualisation community that allows the creation and sharing of figures!
Human Cell Atlas (HCA)
A database of cellular reference maps with position, function and characteristics of every cell type in the human body. Heres the link to the data site. Link to the HCA guide.
Cambridge Cell Atlas
Portal of the human cell atlas. Analyses, visualises and provides tools for exploration of singlke cell RNAseq data generated by HCA. Nice tool.
Single Cell Expression Atlas
Run by the EBI. Can search genes, cell types, organs and diseases across 21 species, 355 studies and 10,505,726 cells. Link to the Help Page.
Curated Cancer Cell Atlas (3CA)
Collected, annotated, and analysed cancer scRNAseq datasets.
Cancer Specific Tools
Leukaemia Gene Database (LeGenD)
Database of leukaemia genes (LeGenD) developed to help biological and medical sciences community to easily access all information on the genes that are involved in leukaemia. Link to Help Page.
Pancreatic Cancer Gene Database (PCGDB)
Provides information on the genes that are involved in pancreatic cancer and this data is targeted to help the biological and medical sciences community for easier access of the latest information on genes causing pancreatic cancer. Link to Help Page.
Mesothelioma Cancer Advocacy
Mesothelioma.net is an advocacy and support group dedicated to providing all the latest in cancer research, treatment, and aid. Our team has worked diligently with health professionals to compile fact-checked and physician approved information regarding this disease and how it can be treated.