NAFIS 2017: Firms Dip Their Toes into Data Lakes for Analytics
Panelists at the North American Financial Information Summit described the transformative potential of “data lakes” for supporting firms’ ever-increasing analytics and alternative data storage requirements. Joanne Faulkner reports.
Data warehouses in firms today have traditionally grown within business silos, or may have come from acquisitions, said Thomas Angelo, digital collaboration services director at BNY Mellon, speaking on a panel at the North American Financial Information Summit held in New York on May 24. “The problem is, they’re very difficult to scale. If you want to add a new business to a warehouse, it’s difficult to add [because that warehouse is typically] a best-of-breed developed for one solution. But to get that enterprise view across a firm to better identify business opportunities… you need to unify [those warehouses]—and that’s where data lakes come in. They are a unified way for applications to access data across that lake,” meaning that anyone within an organization who is interested in that data can use it to run analytics that inform their decision-making process.
This means that the data lake architecture is often attractive to high-level business executives at firms “because you just dump all your data in raw format into one place” that doesn’t require the same amount of upfront work as data warehouses, said Jean-Pascal Chauvet, chief technology officer for Deutsche Bank’s equities business. However, a lot of work is needed to make use of the data stored in the lake, requiring data scientists who understand a firm’s business, he said.
Another disadvantage is that data tipped into the lake is stored there without context. “You don’t have the lineage of the data, why it exists, and how it came to be,” Chauvet said, adding that there are also access control issues. “If you dump all your data in one place, how do you control the access, if you don’t know the context around the data? In this way, data lakes can very quickly become data swamps.”
Wells Fargo thinks of data lakes as “the ‘place’ where our businesses and those who interrogate the data in the lines of business will now have the opportunity to be able to go to this unified ‘ecosystem’ that pulls together their business-respective data in a common place, in a common way, governed and structured…. It’s not an easy thing to pull together,” said Stephen Harris, senior vice president, head of enterprise data strategy at the bank.
But Harris also warned that a data lake is not a quick, easy solution to all of a firm’s data challenges. “A data lake is not a silver bullet…don’t be fooled by the concept that the data lake is going to be the be-all answer. It’s another approach you can take to servicing the insight needs of your respective business partners,” requiring “multiple drawers” to manage structured and unstructured data. “The lake is an approach, a framework; it will evolve to become a technology ecosystem.”
While applications for data lakes are still evolving, panelists agreed that one successful use case for data lakes is internal and external fraud detection. “You need to be able to marry trading data with entitlement data—who has access to what— with records such as security pass swipes. On top of that you need to apply on an algorithm to detect anomalies,” Chauvet said.
Using data lakes for this purpose at Deutsche Bank has proved “somewhat successful,” though this approach does not yet work on the trading floor to provide real-time analytics that detect patterns and signals in real time, or for risk, where the bank found it was easier to normalize risk across the different business divisions and then have a data warehouse. But Chauvet hasn’t given up hope: “This is an area which is evolving very quickly and the techniques are evolving very quickly. As we learn, we may be able to make those cases work,” he said.
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
New working group to create open framework for managing rising market data costs
Substantive Research is putting together a working group of market data-consuming firms with the aim of crafting quantitative metrics for market data cost avoidance.
Off-channel messaging (and regulators) still a massive headache for banks
Waters Wrap: Anthony wonders why US regulators are waging a war using fines, while European regulators have chosen a less draconian path.
Back to basics: Data management woes continue for the buy side
Data management platform Fencore helps investment managers resolve symptoms of not having a central data layer.
‘Feature, not a bug’: Bloomberg makes the case for Figi
Bloomberg created the Figi identifier, but ceded all its rights to the Object Management Group 10 years ago. Here, Bloomberg’s Richard Robinson and Steve Meizanis write to dispel what they believe to be misconceptions about Figi and the FDTA.
SS&C builds data mesh to unite acquired platforms
The vendor is using GenAI and APIs as part of the ongoing project.
Aussie asset managers struggle to meet ‘bank-like’ collateral, margin obligations
New margin and collateral requirements imposed by UMR and its regulator, Apra, are forcing buy-side firms to find tools to help.
Where have all the exchange platform providers gone?
The IMD Wrap: Running an exchange is a profitable business. The margins on market data sales alone can be staggering. And since every exchange needs a reliable and efficient exchange technology stack, Max asks why more vendors aren’t diving into this space.
Reading the bones: Citi, BNY, Morgan Stanley invest in AI, alt data, & private markets
Investment arms at large US banks are taken with emerging technologies such as generative AI, alternative and unstructured data, and private markets as they look to partner with, acquire, and invest in leading startups.