When RBC Capital Markets launched its artificial intelligence-powered trading platform, Aiden, in the fall of 2020, it was the culmination of 10 years of active development work, as the firm sought to address common client issues such as slippage and alpha erosion while executing trades, and periods of volatility that could upend the best historical models.
Machine-learning and AI projects on Wall Street become outdated quickly as market conditions and technology evolve, keeping the makers of such technologies on their toes, and end-users in the ongoing throes of complex, onerous integrations, compliance checks, and training processes.
So when RBC Capital Markets set out to build a trading platform that could withstand the test of time and evolve with its surroundings, they turned to deep reinforcement learning, an advanced form of machine learning under the AI umbrella that remains underutilized in the capital markets.
Shary Mudassir, RBC’s co-head of global electronic trading, joined the firm in 2009 as an associate in the capital markets rotational program, testing his software engineering skills on different desks and gaining an affinity for electronic trading. He watched the space change drastically over nearly 12 years, as clients went from phoning brokers, to scouting sellers or buyers for their orders, to executing their own trades directly in the marketplace, which itself has become a complex network of lit venues, dark pools, off-exchange venues, alternative trading systems and dozens of other liquidity pools.
Ten years ago, the electronic trading desk zeroed in on crafting high-quality execution algorithms, building them internally from scratch. But it wasn’t until five years ago, when RBC Capital Markets partnered with Borealis AI, RBC’s research institute, that the concept of the Aiden algorithmic suite was conceived.
One of these algos, volume-weighted average price (VWap), was the first Aiden solution to go live last fall. With more than 150 of RBC’s largest clients now using it, the algo is meant to reduce slippage—the difference between the expected price of a trade and the price at which the trade is executed—against the VWap benchmark, which gives the average price a security has traded throughout the day, based on both volume and price. Slippage can happen fairly often, but especially so during periods of heightened volatility, he says.
“As we built those solutions over the past 10 years, there was significant growth in the complexity of the code base. And when you marry the complexity of the code base with the sheer dynamic nature of the market, you really need a very robust review and enhancement process to ensure that your execution algos will always be at a high-performing level,” Mudassir says. “Because the moment you roll out your execution algos and the intelligence within them you’ve coded in, market dynamics may change, market participants may change, others might pick up on your approach to executing.”
In past periods of market volatility, RBC would have to re-code its algorithms to adjust for new conditions, and continually review and tweak the algos for as long as the volatility lasted. And when that happened, the firm had to hope that those changes worked. It wasn’t an ideal system, Mudassir says.
So RBC and Borealis began developing their own deep reinforcement learning models to account for all the possible scenarios that can upend a trade—and even a market—so that the algos could become self-learning and autonomous.
The way Mudassir thinks about deep reinforcement learning is that it must work toward an end goal, but how the algo gets to a decision matters the most. Essentially, it isn’t about taking the best single action, but about taking the best possible series of actions—much like a chess game. In chess, “you make a lot of moves, each move can have points, but it’s the cumulative best moves by a player that lead to that person winning,” Mudassir says.
Though the rules are constant, every game of chess is slightly different. Like an algorithm, a player must retain some memory of previous games, while both adapting to changed or changing variables and forecasting for future variables yet unknown. It is exploitation (sureness of what is already known) and exploration (lower levels of confidence in the unknown) working in tandem.
With those components in mind, a skilled chess player or a sophisticated algorithm must consider that “a move right now may seem extremely great, but it may not be the best move because I’m working toward a future end goal,” Mudassir says. That’s the principle that guides Aiden.
The first real test of Aiden’s adaptability and durability arrived with the onset of the Covid-19 pandemic, which spurred the failure of countless historical models. Before the October launch, it had been in a beta trial with a number of large clients for months, who were able to preserve their performance despite overwhelming market uncertainty, without any manual intervention from RBC, Mudassir says.
But all the promise of deep learning comes with a catch. The more complex a model is, the bigger the issues of transparency and explainability. “While we may understand the outcome of the model, it may be difficult for us to fully understand the reasons that led to that outcome. And this is an area we needed to do something about,” Mudassir says.
So in tandem with Aiden, RBC built a secondary platform alongside it called Aiden Insights, a client-facing portal available via web or mobile devices, which offers users real-time visibility into the decisions Aiden makes on how it executes on their order flows. Mudassir hopes that Aiden Insights can be example of trust and transparency for the rest of the industry, especially those who are hesitant to use the technology due its opaqueness.
RBC has begun building off of its VWap algo, with a new release pegged for the fourth quarter of this year that is currently in testing with clients. The second algo targets another important benchmark—arrival price, which is the midpoint between the bid/ask prices at the time an order is placed. It can be difficult to hit that midpoint because as a trader starts buying up certain shares, sellers can get a sense of that movement, causing the price to rise and effectively minimizing the alpha that a portfolio manager wants to capture for their own clients.
Though RBC’s live or in-testing implementations of deep reinforcement learning are currently limited to execution, the firm is considering how it can be used to navigate the fragmented network of liquidity pools and how to best deliver timely and consumable information to clients.
“[Reinforcement learning] is definitely the next big evolution. That’s how we see it, and that’s why we’re so committed to it,” Mudassir says. “No environment is more dynamic and complex and ever-changing than the stock market. Our clients rely on us to help them execute their trades in these markets. And that’s why we felt that this would be a perfect fit for RL. And it’s just been tremendous to see that hypothesis prove out over the last couple years.”
Further reading
Only users who have a paid subscription or are part of a corporate subscription are able to print or copy content.
To access these options, along with all other subscription benefits, please contact info@waterstechnology.com or view our subscription options here: http://subscriptions.waterstechnology.com/subscribe
You are currently unable to print this content. Please contact info@waterstechnology.com to find out more.
You are currently unable to copy this content. Please contact info@waterstechnology.com to find out more.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@waterstechnology.com
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@waterstechnology.com
More on Emerging Technologies
This Week: Startup Skyfire launches payment network for AI agents; State Street; SteelEye and more
A summary of the latest financial technology news.
Waters Wavelength Podcast: Standard Chartered’s Brian O’Neill
Brian O’Neill from Standard Chartered joins the podcast to discuss cloud strategy, costs, and resiliency.
SS&C builds data mesh to unite acquired platforms
The vendor is using GenAI and APIs as part of the ongoing project.
Chevron’s absence leaves questions for elusive AI regulation in US
The US Supreme Court’s decision to overturn the Chevron deference presents unique considerations for potential AI rules.
Reading the bones: Citi, BNY, Morgan Stanley invest in AI, alt data, & private markets
Investment arms at large US banks are taken with emerging technologies such as generative AI, alternative and unstructured data, and private markets as they look to partner with, acquire, and invest in leading startups.
Startup helps buy-side firms retain ‘control’ over analytics
ExeQution Analytics provides a structured and flexible analytics framework based on the q programming language that can be integrated with kdb+ platforms.
The IMD Wrap: With Bloomberg’s headset app, you’ll never look at data the same way again
Max recently wrote about new developments being added to Bloomberg Pro for Vision. Today he gives a more personal perspective on the new technology.
LSEG unveils Workspace Teams, other products of Microsoft deal
The exchange revealed new developments in the ongoing Workspace/Teams collaboration as it works with Big Tech to improve trader workflows.