AI, Machine Learning Can Tackle Dirty Data
At this year's Waters USA event, panelists discussed the benefits of machine learning and AI, and where these technologies are still lacking.
Data is described as a "sea," a "firehose," and a "tsunami." There's a lot of data and as the ability to take in and store information has improved, the ability to analyze the growing inflow of data is the greatest challenge faces firms, whether for finding trading opportunities, managing risk or for regulatory reporting.
While everyone wants clean data, a lot of value exists in dirty, unstructured data. That's where Marc Alvarez, chief data officer at Mizuho Securities, believes that artificial intelligence—and, by extension, machine learning—can help. Rather than worrying so much about clean and dirty, use AI to sift through that sea of/firehose of/tsunami of information to direct the user to useful data hiding in the mud.
"I don't talk about clean data," said Alvarez, speaking at the Waters USA conference in Manhattan. "We talk about data in the terms of control and not in control. We don't prevent data from going anywhere. The reality is that today, as the business evolves we are becoming a very quantitative- and derivative-driven world. So we end up with these relatively sophisticated methods of moving money in and scaling it up for our purposes. By definition, all the data can't be right, nor should you under any circumstances attempt to make it all perfectly shiny and new."
He said it's best to leave it up to the business to decide what is good or bad data. Where firms should focus their investment is in skills around quantitative and statistical analyses.
"The reality of statistics is that we've been interpolating missing values and time series forever," he said. "We use statistics to measure the distribution and probabilities of populations—by definition it is not perfect, and this is where the strength of AI, in particular, is useful because it leverages those abilities."
Establishing new controls and understanding what the data is being used for is also important.
"But artificial intelligence is coming—let's be frank about it," Alvarez said. "If it's a better mousetrap, it's going to find its place and scale up. So what that means is, what are the controls in the organization? These are going to be controls that we haven't had before; they're going to be very different—moving to real-time evaluation of buy and sell, real-time profitability analysis, maybe even predictive profitability analysis. So it's not just about the technology and it's not just about the data. It's actually going to drive more of a systemic organizational change."
Getting Buy-In
Alvarez said that at Mizuho, senior management on the business side, working with IT and operations, has driven the use of AI in the firm. The business poses certain questions and asks for ideas to solve for.
"People talk about how financial firms are turning into technology companies. Well guess what the core competency of a technology company is: telling your management what to invest in and what they're going to get for their investment."
Charles Fiori, a consultant with 30 years of experience in the finance space, said that since it's an organizational shift, "the impetus has to come from the top; there has to be sponsorship from the upper levels of the organization and it has to be made clear that making this work is a priority for the company," Fiori said.
Still Concerns
Joseph Lodato, global head of compliance technology and surveillance at Guggenheim Securities, said getting that buy-in can have challenges.
"The problem with something coming from the top is that you have to go and climb that ladder," he said. He added that where firms have to be careful is that when pitching too many options, the process can become much more complicated than necessary. "It's like being in an app store and there are 10,000 apps," Lodato said.
Alvarez added an extra caveat: AI and machine learning is not going to solve all problems.
"Throwing automation at this stuff—especially the big datasets and datasets that are intermingled with relatively structured/unstructured datasets—it has a habit of creating an awful lot of false positives," he said. "So the business is going to approach this in a very conservative fashion."
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Data Management
New working group to create open framework for managing rising market data costs
Substantive Research is putting together a working group of market data-consuming firms with the aim of crafting quantitative metrics for market data cost avoidance.
Off-channel messaging (and regulators) still a massive headache for banks
Waters Wrap: Anthony wonders why US regulators are waging a war using fines, while European regulators have chosen a less draconian path.
Back to basics: Data management woes continue for the buy side
Data management platform Fencore helps investment managers resolve symptoms of not having a central data layer.
‘Feature, not a bug’: Bloomberg makes the case for Figi
Bloomberg created the Figi identifier, but ceded all its rights to the Object Management Group 10 years ago. Here, Bloomberg’s Richard Robinson and Steve Meizanis write to dispel what they believe to be misconceptions about Figi and the FDTA.
SS&C builds data mesh to unite acquired platforms
The vendor is using GenAI and APIs as part of the ongoing project.
Aussie asset managers struggle to meet ‘bank-like’ collateral, margin obligations
New margin and collateral requirements imposed by UMR and its regulator, Apra, are forcing buy-side firms to find tools to help.
Where have all the exchange platform providers gone?
The IMD Wrap: Running an exchange is a profitable business. The margins on market data sales alone can be staggering. And since every exchange needs a reliable and efficient exchange technology stack, Max asks why more vendors aren’t diving into this space.
Reading the bones: Citi, BNY, Morgan Stanley invest in AI, alt data, & private markets
Investment arms at large US banks are taken with emerging technologies such as generative AI, alternative and unstructured data, and private markets as they look to partner with, acquire, and invest in leading startups.