Starfish Helps Tame the Wild West of Massive Unstructured Data

“What data do you have? And can I access it?” Those may seem like simple questions for any data-driven enterprise. But when you have billions of files spread across petabytes of storage on a parallel file system, they actually become very difficult questions to...

The Gruesome Job of Managing Petabytes of Scientific Data

“Oh, wow, the IT department charges us $11,000 a month for data storage. Why don’t we put some data on cheaper storage? Mark, why don’t you look into that?” the principal investigator asks the postdoc who just joined the team. There are thousands of similar...