Main Page: Difference between revisions

From ReddNet
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
__NOTOC__
__NOTOC__


== What is {{Template:REDDnet}}? ==
== {{Template:REDDnet}}:  Enabling Data Intensive Science in the Wide Area ==


{|
{|
|- valign="top"
|- valign="top"
|{{Template:REDDnet}} (Research and Education Data Depot network) is an NSF-funded data collaboration project.  When completed REDDnet will provide an extremely large (>700 Terabytes) distributed storage facility with a core focus of scalability, speed and fault tolerance. Currently (Spring 08), there are roughly 160 TB deployed.
|{{Template:REDDnet}} (Research and Education Data Depot network) is an NSF-funded project to provide a large distributed facility with a core focus of data intensive collaboration in the wide area  Our mission is to provide working storage, not archival or long-term personal storage, for research collaborations who are either trying to move data from one collaborating institution to another or to allow these collaborations to share large data sets for brief periods of time (a few months) while they work on it.
 
One example might be the [http://cms.cern.ch/ CMS] collaboration, a high energy physics experiment that will be taking data soon at the Large Hadron Collider (LHC) at [http://public.web.cern.ch/ CERN].  Groups of researchers, distributed across the country and the world, will want to use data products derived from the raw data produced by collisions in the LHC to do a variety of tasks from calibrating the detector to searching for new physics.  They will want these data products available for anywhere from a month to a few months, and they would benefit greatly if this data could be available for processing on their distributed computing infrastructure and in particular on the [http://www.opensciencegrid.org/ Open Science Grid].
 
Another example, from the [http://www.americaview.org/ AmericaView] project, might occur in the aftermath of an earthquake in California or a Huricane on the Gulf Coast, where reasearchers across the country will want access to the geospatial image data
from satellites covering the affected regionFor a few months after the event, this data could be uploaded to REDDnet and made available to this community.
 
 
Initially, REDDnet will deploy >700 Terabytes of distributed storage with an emphasis on scalability, speed and fault tolerance. Currently (Spring 08), there are roughly 160 TB deployed.


For example, at the
For example, at the

Revision as of 09:24, 6 May 2008


REDDnet: Enabling Data Intensive Science in the Wide Area

REDDnet (Research and Education Data Depot network) is an NSF-funded project to provide a large distributed facility with a core focus of data intensive collaboration in the wide area Our mission is to provide working storage, not archival or long-term personal storage, for research collaborations who are either trying to move data from one collaborating institution to another or to allow these collaborations to share large data sets for brief periods of time (a few months) while they work on it.

One example might be the CMS collaboration, a high energy physics experiment that will be taking data soon at the Large Hadron Collider (LHC) at CERN. Groups of researchers, distributed across the country and the world, will want to use data products derived from the raw data produced by collisions in the LHC to do a variety of tasks from calibrating the detector to searching for new physics. They will want these data products available for anywhere from a month to a few months, and they would benefit greatly if this data could be available for processing on their distributed computing infrastructure and in particular on the Open Science Grid.

Another example, from the AmericaView project, might occur in the aftermath of an earthquake in California or a Huricane on the Gulf Coast, where reasearchers across the country will want access to the geospatial image data from satellites covering the affected region. For a few months after the event, this data could be uploaded to REDDnet and made available to this community.


Initially, REDDnet will deploy >700 Terabytes of distributed storage with an emphasis on scalability, speed and fault tolerance. Currently (Spring 08), there are roughly 160 TB deployed.

For example, at the Supercomputing 2006 Conference in Tampa, Florida, REDDnet demonstrated sustained transfers at a rate of 10 Gigabits per second between Caltech and the convention floor. These transfers were limited by the bandwidth of the network connection. At the same conference, REDDnet demonstrated fault tolerance by striping data across thirty depots and then successfully reading the data even after turning off nine of these depots.

Reddnetmap.gif

Research Projects Using REDDnet

  • AmericaView - Satellite remote sensing data and technologies in support of applied research, K-16 education, workforce development, and technology transfer.
  • CMS - Elementary Particle Physics at the CERN Large Hadron Collider.
  • Structural Biology - Image reconstruction of large macromolecular assemblies through a collaborative effort of Vanderbilt and Lawrence Berkeley National Laboratory researchers.
  • Retinopathy - Diabetic Eye Disease Screening in Peru and Bolivia


Collaborators

Core Institutions


Error creating thumbnail: Unable to save thumbnail to destination
Tennessee
Stephen F. Austin
Error creating thumbnail: Unable to save thumbnail to destination
Error creating thumbnail: Unable to save thumbnail to destination
N. C. State
Delaware
Vanderbilt Tennessee S. F. Austin ORNL Nevoa Networks N. C. State Delaware


Collaborating Host Institutions


USP
UERJ
Michigan
Florida
Fermilab
Caltech
São Paulo Rio de Janeiro Michigan Florida Fermilab Caltech


AMPATH
LOC
LOC
Error creating thumbnail: Unable to save thumbnail to destination
LOC
AMPATH FIU Library of Congress SDSC Stanford UCSB

Support

NSF.gif This work is supported by NSF Grant PHY-0619847 and by the Vanderbilt Center for the Americas