Difference between revisions of "FlyBase:Using FTP Archives"

From FlyBase Wiki
Jump to navigation Jump to search
Line 3: Line 3:
  
  
FlyBase is no longer supporting the archive servers, which replicated the website as it existed at the time of archiving. However, all of the data behind every release is available at our FTP archives, which you can reach on the [https://flybase.org/downloads/archivedata Archived Releases] page, from the Downloads tab of the menu bar.
+
FlyBase is no longer supporting the archive servers, which replicated the website as it existed at the time of archiving. However, all of the data behind every release is available at our FTP archives, which you can reach on the [https://flybase.org/downloads/archivedata Archived Releases] page, from the Downloads tab of the menu bar. From there, scroll down to the Main Data Archives section:
  
  
[[File:MainFTPData.png|frameless|800px|The Main Data Archives section of the Archived Releases page]]
+
[[File:MainFTPData.png|frameless|left|800px|The Main Data Archives section of the Archived Releases page]]
 
 
  
 
==Genome archives==
 
==Genome archives==
 
[[File:Index_of_FTP_genomes_archive.png|left|thumb|100px|Index of the FTP genomes archive.]]
 
[[File:Index_of_FTP_genomes_archive.png|left|thumb|100px|Index of the FTP genomes archive.]]
 
[[File:FTP_archive_for_D_grimshawi_FB2017_01.png|right|thumb|200px|Index of the D. grimshawi, FB2017_01 archive.]]
 
[[File:FTP_archive_for_D_grimshawi_FB2017_01.png|right|thumb|200px|Index of the D. grimshawi, FB2017_01 archive.]]
The '''[http://ftp.flybase.net/genomes/ FTP genomes archive]''' link holds genomic sequence data from many Drosophilid species in [https://useast.ensembl.org/info/website/upload/gff.html GFF, GTF], and [https://www.ncbi.nlm.nih.gov/WebSub/html/help/fasta.html FASTA] files formats. These are organized by species, so if you are interested in a particular non-melanogaster Drosophilid, this is the easiest way to find data specific to that species.  
+
The '''[http://ftp.flybase.net/genomes/ FTP genomes archive]''' link holds genomic sequence data from many Drosophilid species, organized first by species and then by release.  If you are interested in a particular non-melanogaster Drosophilid, this is the easiest way to find all data specific to that species.  
  
 +
 +
Data includes [https://useast.ensembl.org/info/website/upload/gff.html GFF, GTF], and [https://www.ncbi.nlm.nih.gov/WebSub/html/help/fasta.html FASTA] files formats, as well as the [[FlyBase:Downloads_Overview#Postgres_Chado_Database_Dump|Chado-XML]] database files for that species and release. The '''dna''' folder contains unprocessed sequences as .raw files.
  
 
The fullname (e.g. Drosophila grimshawi) and four-letter FlyBase species abbreviation (e.g. Dgri) are different folders, but contain the same files within them.
 
The fullname (e.g. Drosophila grimshawi) and four-letter FlyBase species abbreviation (e.g. Dgri) are different folders, but contain the same files within them.
Line 19: Line 20:
  
 
Precomputed files were never made for non-melanogaster species, so they are not available here.
 
Precomputed files were never made for non-melanogaster species, so they are not available here.
 
 
 
  
  

Revision as of 19:08, 21 February 2024

This page will contain a help guide for using FTP archives.


FlyBase is no longer supporting the archive servers, which replicated the website as it existed at the time of archiving. However, all of the data behind every release is available at our FTP archives, which you can reach on the Archived Releases page, from the Downloads tab of the menu bar. From there, scroll down to the Main Data Archives section:


The Main Data Archives section of the Archived Releases page

Genome archives

Index of the FTP genomes archive.
Index of the D. grimshawi, FB2017_01 archive.

The FTP genomes archive link holds genomic sequence data from many Drosophilid species, organized first by species and then by release. If you are interested in a particular non-melanogaster Drosophilid, this is the easiest way to find all data specific to that species.


Data includes GFF, GTF, and FASTA files formats, as well as the Chado-XML database files for that species and release. The dna folder contains unprocessed sequences as .raw files.

The fullname (e.g. Drosophila grimshawi) and four-letter FlyBase species abbreviation (e.g. Dgri) are different folders, but contain the same files within them.


Precomputed files were never made for non-melanogaster species, so they are not available here.



Release archives

The FTP releases archive link holds data organized by release, for both D. melanogaster and non-melanogaster Drosophilids.

Between these two links, there are multiple redundant ways to reach the same file.