How Goldman Sachs Looked to Google To Reinvent its Data Management Strategy

goldman-sachs-new — Goldman Sachs adopted innovative technologies from Google and Amazon to tackle its data challenges.

- By Aggelos Andreou
- @AggelosAndreou
- 27 Nov 2017

Tweet
Facebook
LinkedIn
Save this article
Send to
Print this page

One of the biggest challenges facing financial services firms today is not necessarily the amount of data they have to manage and store, but how much data they end up throwing away, given the fact that over the last couple of years they have produced increasingly large data volumes.

At this year’s WatersTechnology Innovation Summit in London, Goldman Sachs’ head of technology, Joanne Hannaford, said that in order to tackle this challenge and create better data flows, organizations need to start thinking about their data in a more innovative, entrepreneurial way.

Hannaford explained that Goldman Sachs had to rethink its data strategy, given that the firm employs around 32,000 people globally, of which around 8,000 are engineers/computer scientists that work with and produce enormous datasets. “We have all the issues companies face in terms of dealing with data at scale,” she said. “We produce billions of lines of data and have large numbers of datasets in an extremely distributed way.”

Hannaford said that one of the biggest issues facing large corporations is how organizational boundaries and different divisions within them make it difficult to ask questions about data broadly, and then access the data without having to find it, put it into a database, and create ownership of it.

“Historically, what we’ve done with data is put it into databases and these spaces required you to define your data upfront,” Hannaford said. “We had to understand how to deal with that data before we even got to the point of finding it.”

Creating Databases

According to Hannaford, Goldman Sachs resorted to creating new databases for the sake of flexibility, which resulted in the bank ultimately running 75,000 databases, in which it was impossible to cross-reference that data and introduce a common identifier for its business trades and transactions. The bank found the solution to that challenge by learning from a pair of trailblazers in consumer industries: Google and Amazon.

“If you think about Google or Amazon, they have created database technologies, which make us completely rethink how we store data,” Hannaford said. “They have built database technologies that allow you to quickly assemble data; it’s more like a directory structure, where you can copy a file from one directory to another.” She explained that this strategy makes it easier to ingest data into a database.

“Think about how Amazon reinvented itself,” she said. “It took its engineering department and made it from an expense into a revenue [stream], by adapting the technology it was developing for itself and applying it [to address its data challenges].”

Another challenge that needed to be addressed, according to Hannaford, was to solve the bank’s data query issue. Goldman Sachs opted to build a data lake underpinned by artificial intelligence technology, borrowed from Google’s semantic web, which sits underneath the firm’s Chrome web browser.

“That allowed us to take data from all different sources and perform queries and ask questions [of it],” she said. “We defined the data upfront and we linked it to our sources, which gave us the ability to make sense and understand different questions from our data.”

Hannaford is firm in her belief that data tends to be an undervalued commodity across the capital markets, with large numbers of firms resorting to “giving it away,” at the expense of managing it efficiently and logically and treating it as one of its most valued possessions. “The lessons I’ve learned is that data is a first-class asset,” she explained. “It is the most valuable thing companies have. If you really value something, you understand who owns it and you understand how good it is. That’s why in financial services data is not really valued. For years we have been giving it away and we haven’t commoditized it.”

Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.

To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe

You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.

You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.

As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.

If you would like to purchase additional rights please email info@waterstechnology.com

You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.

If you would like to purchase additional rights please email info@waterstechnology.com

More on Data Management

New working group to create open framework for managing rising market data costs

Substantive Research is putting together a working group of market data-consuming firms with the aim of crafting quantitative metrics for market data cost avoidance.

29 Aug 2024

Off-channel messaging (and regulators) still a massive headache for banks

Waters Wrap: Anthony wonders why US regulators are waging a war using fines, while European regulators have chosen a less draconian path.

28 Aug 2024

Back to basics: Data management woes continue for the buy side

Data management platform Fencore helps investment managers resolve symptoms of not having a central data layer.

27 Aug 2024

‘Feature, not a bug’: Bloomberg makes the case for Figi

Bloomberg created the Figi identifier, but ceded all its rights to the Object Management Group 10 years ago. Here, Bloomberg’s Richard Robinson and Steve Meizanis write to dispel what they believe to be misconceptions about Figi and the FDTA.

23 Aug 2024

SS&C builds data mesh to unite acquired platforms

The vendor is using GenAI and APIs as part of the ongoing project.

22 Aug 2024

Aussie asset managers struggle to meet ‘bank-like’ collateral, margin obligations

New margin and collateral requirements imposed by UMR and its regulator, Apra, are forcing buy-side firms to find tools to help.

20 Aug 2024

Where have all the exchange platform providers gone?

The IMD Wrap: Running an exchange is a profitable business. The margins on market data sales alone can be staggering. And since every exchange needs a reliable and efficient exchange technology stack, Max asks why more vendors aren’t diving into this space.

19 Aug 2024

Reading the bones: Citi, BNY, Morgan Stanley invest in AI, alt data, & private markets

Investment arms at large US banks are taken with emerging technologies such as generative AI, alternative and unstructured data, and private markets as they look to partner with, acquire, and invest in leading startups.

15 Aug 2024