Share this page | Email | Contact Us

Special Report on

Petabyte-scale data management

petabyte scale data management special research report Photo by i.ytimg.com
A few weeks ago, I had the chance to visit eBay, meet briefly with Oliver Ratzesberger and his team, and then catch up later with Oliver for dinner. I’ve already alluded to those discussions in a couple of posts, specifically on MapReduce (which eBay doesn’t like) and the astonishingly great difference between high- and low-end disk drives (to which eBay clued me in). Now I’m finally getting around to writing about the core of what we discussed, which is two of the very largest data warehouses in the world. Metrics on eBay’s main Teradata data warehouse include: >2 petabytes of user data 10s of 1000s of ...
programs, PostgreSQL is not controlled by any single company, but has a global community of developers and companies to develop it.
REVIEWS AND OPINIONS
IBM Unveils Software and Services to Help Organizations Make Sense ...
To help clients transform their businesses through information management and analytics, IBM (NYSE: IBM) today announced new software and services designed to help organizations take advantage of the growing and diverse forms of data and content. To view the multimedia assets associated with this release, please click http://www.prnewswire.com/news-releases/ibm-unveils-software-and-services-to-help-organizations-make-sense-of-their-deluge-of-data-94227449.html The importance of business insight and analytics can be found in IBM’s 2010 Global CEO Study, which reveals how leading companies are using new approaches to ... market research, surveys and trends
[Dbworld] SSDBM 2010 in Heidelberg, Germany: Call for ...
Please do not post msgs that are not relevant to the database community at large. Go to www.cs.wisc.edu/dbworld for guidelines and posting forms. To unsubscribe, go to https://lists.cs.wisc.edu/mailman/listinfo/dbworld This entry was posted on Friday, May 14th, 2010 at 1:36 pm and is filed under Uncategorized . You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed. Comments are closed. market research, surveys and trends

SURVEY RESULTS FOR
PETABYTE-SCALE DATA MANAGEMENT

Notes on the Oracle Database 11g Release 2 white paper | DBMS2 ...
has evidently been edited, given that a phrase I quoted last month is no longer to be found. Anyhow, here are some quotes from and comments on what evidently is the latest version. The In-Memory Database Cache (IMDB Cache) option of Oracle Database 11g Release 2, allows data to be cached and processed in the memory of the applications themselves, off-loading the data processing to middle tier resources. Any network latency between the middle tier and the back-end database is removed from the transaction path, with the result that individual transactions can often be executed up to 10 times faster. This is particularly useful ... industry trends, business articles and survey research
Press Release: Data Warehousing, Financial Services Solutions ...
Sybase, Inc. (NYSE: SY), a leading provider of enterprise infrastructure and mobile software, today announced in conjunction with Sun Microsystems and BMMSoft, that Sybase IQ powers the world’s largest data warehouse implemented in history, as noted in the independently audited report, “Sun Data Warehouse Reference Architecture for Structured and Unstructured Data,” August 2007. The significant benchmarks were achieved in large part due to the unique compression capability of Sybase IQ, a highly optimized analytics server not only for compression but also for query performance and lightning speed load ... industry trends, business articles and survey research
RELATED NEWS
A cost-effective approach for petabyte storage systems
This vendor-written tech primer has been edited by Network World to eliminate product promotion, but readers should note it will likely favor the submitter's approach. The onslaught of unstructured digital content -- video, audio and images -- is taxing storage systems and creating the need to be able to store multi-petabytes, but current industry practices using RAID and replication to accomplish data protection are expensive at this scale. Dispersal, a new approach, is cost effective for petabytes of digital content storage. Further, it provides extraordinary data protection, meaning digital assets will not be ... market trends, news research and surveys resources
Simplest Ethernet storage validated
Coraid's simpler-than-iSCSI Ethernet storage protocol has been validated by ESG which found it could install and use it in less than two minutes, and get better-than Fibre Channel performance at a fifth of the cost. Coraid's EtherDrive SAN storage uses the lightweight ATA-over-Ethernet (AoE) protocol to link servers and storage arrays using standard Ethernet switches. This protocol ensures lossless delivery of data packets without involving upper level network stack processes such as the TCP/IP ones used by iSCSI. The ESG report (pdf) describes hands-on testing of the product in a virtualised server environment. It ... market trends, news research and surveys resources

INFORMATION RESOURCES

WinterCorp Research Discusses the Top 10 Features of - WinterCorp ...
terabyte- and petabyte-scale data management systems throughout their lifecycle. Since our inception in 1992, we have architected ... technology research, surveys study and trend statistics
Building the Teraflops/Petabytes Production Supercomputing Center
scientific data management. NERSC has developed a data-intensive computing ..... Ian Foster: Large-Scale Data Grids. DOE Conference on High Speed Computing, ... technology research, surveys study and trend statistics
Petabyte-Scale Storage @ SSRC
We are investigating the construction of large-scale storage systems using object-based storage devices (OSDs). An OSD is a network-attached storage device that presents an interface of arbitrarily-named data objects of variable size rather than sequentially numbered fixed-size blocks, to deal with the data storage details, such as request scheduling and data layout. Metadata is managed separately by one or more specialized metadata servers (MDSs), which is critical to scalability, reliability and security. The separation of data and metadata storage and management provides very high access bandwidth to the large-scale ...
REAL TIME
PETABYTE-SCALE DATA MANAGEMENT
latest webinars
  1. SEPATON / Storage Strategies NOW to Present March 16 Webinar on ...
  2. SYS-CON Webcast
Join these Webinars to learn more about current research, trends and surveys.
QUESTIONS AND ANSWERS
Slashdot | Building a Massive Single Volume Storage Solution?
"I've been asked to build a massive storage solution to scale from an initial threshold of 25TB to 1PB, primarily on commodity hardware and software. Based on my past experience and research, the commercial offerings for such a solution becomes cost prohibitive, and the budget for the solution is fairly small. Some the technologies that I've been scoping out are iSCSI , AoE and plain clustered/grid computers with JBOD (just a bunch of disks). Personally I'm more inclined on a grid cluster with 1GB interface where each node will have about 1-2TB of disk space and each node is based on a 'low' power ...
What's The Pros and Cons Of Using Ethernet As The Backbone Of Your ...
I'm looking for honest professional opinions of the benefits AND drawbacks of using etherent (e.g. GigE, etc.) as the backbone of a company communication network (voice AND/OR data). If you have personal experience as a user OR provider please share lessons learned. I'm looking for more thought and detail than just "it's cheaper" ... please explain and support your opinions with examples where possible. This is purely a hypothetical question. If you need more details fill them in yourself as you deem approrpriate (e.g. applications, size company, number of users, existing infrastructure, etc.). Just make that ...