Download CanProVar Data
CanProVar provides the download of human protein database (Ensembl v79) in the fasta format, in which variation information is recorded in the header line of each sequence.
The README file explains the contents of the following files:
|
 
|
Description
|
Protein(FASTA)
|
Statistics
|
Validated dbSNP_nsSNPs
|
variation information from validated coding SNPs
|
dbSNP_validated_nsSNP_protein
|
967,017 |
Cancer related_nsSNPs |
mutations that have been reported in cancer samples
|
cancer_nsSNP_protein
|
156,671 |
Both
|
nsSNPs from both the dbSNP_validated* file and the cancer_* file |
all_nsSNP_protein |
1123,688 |
The csSNPs of each cancer type |
Cancer Name
|
Statistics
|
Protein(FASTA)
|
Adrenal Gland Neoplasms |
15 |
 |
Biliary Tract Cancer |
570 |
 |
Bone Neoplasms |
415 |
 |
Brain Cancer |
327 |
 |
Breast Cancer |
17460 |
 |
Central Nervous System Neoplasms |
5905 |
 |
Colorectal Cancer |
1127 |
 |
Esophageal Cancer |
2246 |
 |
Gastric Cancer |
6693 |
 |
Head and Neck Cancer |
9450 |
 |
Hepatocellular Carcinoma |
5143 |
 |
Intestines Cancer |
18090 |
 |
Leukemia |
8134 |
 |
Lung Cancer |
14382 |
 |
Lymphoma |
1110 |
 |
Melanoma |
11659 |
 |
Myeloproliferative Disorders |
53 |
 |
Neoplasms by Histologic Type |
1491 |
 |
Non-small cell lung carcinoma |
60 |
 |
Oral Cancer |
3 |
 |
Ovarian Cancer |
17303 |
 |
Pancreatic Cancer |
6353 |
 |
Parathyroid Carcinoma |
240 |
 |
Pituitary Carcinoma |
140 |
 |
Prostate Cancer |
1804 |
 |
Renal Cancer |
4183 |
 |
Sarcoma |
31 |
 |
Skin Cancer |
6137 |
 |
Small cell lung carcinoma |
34 |
 |
Testicular Cancer |
91 |
 |
Thyroid Carcinoma |
1804 |
 |
Urinary Bladder Cancer |
3718 |
 |
Uterine Cancer |
8594 |
 |
Vulva Cancer |
172 |
 |
acute lymphocytic leukemia |
6 |
 |
acute myeloid leukemia |
8 |
 |
breast ductal carcinoma |
23 |
 |
chronic lymphocytic leukemia |
198 |
 |
chronic myeloid leukemia |
21 |
 |
follicular thyroid carcinoma |
4 |
 |
pancreatic ductal adenocarcinoma |
11 |
 |
Download MS-CanProVar Data
 
|
Protein Sequence Database
|
MS-CanProVar (version 2.0) |
MS-CanProVar2
|
MS-CanProVar (version 2.0) is a protein sequence database that includes variation information to facilitate peptide variant detection in shotgun proteomics. In the .fasta file, each variant peptide is included as an independent entry; variations are annotated in the header line; variations are labeled as "rs" for SNPs and "cs" for cancer-related mutations. Please refer to A bioinformatics workflow for variant peptide detection in shotgun proteomics. Li et al., MCP, 2011 for details about the MS-CanProVar database. The current version of MS-CanProVar is based on Ensembl V79.
|
/*
Will use a file to keep the counts of the local and remote users
The files are called local_users.txt and remote_users.txt
When a user hits the page the ip address is checked against
the local subdomains.
Then the correct file is opened and the count pulled from the file
and incremented and then overwirte the same file with the new count
and the latest date.
Really do not find a need to keep this in a dbase.
*/
$local_sub_domain_one = "129.59";
$local_sub_domain_two = "160.129";
$local_sub_domain_three = "10.151";
$local_sub_domain_four = "10.0";
// grab the users IP address
if (@$_SERVER['HTTP_X_FORWARD_FOR']) {
$ip = $_SERVER['HTTP_X_FORWARD_FOR'];
} else {
$ip = $_SERVER['REMOTE_ADDR'];
}
// get the octets of the IP
@list($first_octet, $second_octet, $third_octet, $fourth_octet) = explode(".", $ip);
// This is the users subdomain we want to know if from Vanderbilt or remote
$subdomain = $first_octet;
$subdomain .= ".";
$subdomain .= $second_octet;
// If local subdomain print to local_users.txt else print to remote_users.txt
// this if statement is one line
// open the file and read the count into a variable trim it and increment it. Close this file and reopen the same file and write
// which overwirtes the same file with the new count with the latest date.
if ((strstr($subdomain, $local_sub_domain_one)) || (strstr($subdomain, $local_sub_domain_two)) || (strstr($subdomain, $local_sub_domain_three))|| (strstr($subdomain, $local_sub_domain_four))){
$fp = fopen("./stats/download_users.txt", "r") or exit("Unable to open file for read");
$count = fgets($fp);
$count = trim($count);
$time = @date("m\/d\/y"); // IMPORTANT Those are not Vs they are escape(backslash)/ (\ /) "m \ /d\ / but no space in code
fclose($fp);
$count++;
$fp = fopen("./stats/download_users.txt", "w") or exit("Unable to open file for write");
fwrite($fp, $count);
fwrite($fp, "\n");
fwrite($fp, $time);
fclose($fp);
$fp = fopen("./stats/local_users.txt", "r") or exit("Unable to open file for read");
$count = fgets($fp);
$count = trim($count);
$time = @date("m\/d\/y"); // IMPORTANT Those are not Vs they are escape(backslash)/ (\ /) "m \ /d\ / but no space in code
fclose($fp);
$count++;
$fp = fopen("./stats/local_users.txt", "w") or exit("Unable to open file for write");
fwrite($fp, $count);
fwrite($fp, "\n");
fwrite($fp, $time);
fclose($fp);
}
else
{
$fp = fopen("./stats/download_users.txt", "r") or exit("Unable to open file for read");
$count = fgets($fp);
$count = trim($count);
$time = @date("m\/d\/y"); // IMPORTANT Those are not Vs they are escape(backslash)/ (\ /) "m \ /d\ / but no space in code
fclose($fp);
$count++;
$fp = fopen("./stats/download_users.txt", "w") or exit("Unable to open file for write");
fwrite($fp, $count);
fwrite($fp, "\n");
fwrite($fp, $time);
fclose($fp);
$fp = fopen("./stats/remote_users.txt", "r") or exit("Unable to open file for read");
$count = fgets($fp);
$count = trim($count);
$time = @date("m\/d\/y"); // Again those are not Vs escape(using backslash) forward slash
fclose($fp);
$count++;
$fp = fopen("./stats/remote_users.txt", "w") or exit("Unable to open file for write");
fwrite($fp, $count);
fwrite($fp, "\n");
fwrite($fp, $time);
fclose($fp);
}
?>
@2015 Menghuan Zhang, Bing Zhang, Jing Li
|