Did data drift in AI models cause the Equifax credit score glitch?

Credit Source

Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Watch here.

Earlier this year, from March 17 to April 6, 2022, credit reporting agency Equifax had an issue with its systems that led to incorrect credit scores for consumers being reported.

The issue was described by Equifax as a ‘coding issue’ and has led to legal claims and a class action lawsuit against the company. There has been speculation that the issue was somehow related to the company’s AI systems that help to calculate credit scores. Equifax did not respond to a request for comment on the issue from VentureBeat.

“When it comes to Equifax, there is no shortage of finger-pointing,” Thomas Robinson, vice president of strategic partnerships and corporate development at Domino Data Lab, told VentureBeat. “But from an artificial intelligence perspective, what went wrong appears to be a classic issue, errors were made in the data feeding the machine learning model.”

Robinson added that the errors could have come from any number of different situations, including labels that were updated incorrectly, data that was manually ingested incorrectly from the source or an inaccurate data source.

Event

MetaBeat 2022

MetaBeat will bring together thought leaders to give guidance on how metaverse technology will transform the way all industries communicate and do business on October 4 in San Francisco, CA.

The risks of data drift on AI models

Another possibility that Krishna Gade, cofounder and CEO of Fiddler AI speculated was possible, was a phenomenon known as data drift. Gade noted that according to reports, the credit scores were sometimes off by 20 points or more in either direction, enough to alter the interest rates consumers were offered or to result in their applications being rejected altogether.

Gade explained that data drift can be defined as the unexpected and undocumented changes to the data structure, semantics and distribution in a model.

He noted that drift can be caused by changes in the world, changes in the usage of a product, or data integrity issues, such as bugs and degraded application performance. Data integrity issues can occur at any stage of a product’s pipeline. Gade commented that, for example, a bug in the front-end might permit a user to input data in an incorrect format and skew the results. Alternatively, a bug in the backend might affect how that data gets transformed or loaded into the model.

Data drift is not an entirely uncommon phenomenon, either.

“We believe this happened in the case of the Zillow incident, where they failed to forecast house prices accurately and ended up investing hundreds of millions of dollars,” Gade told VentureBeat.

Gade explained that from his perspective, data drift incidents happen because implicit in the machine learning process of dataset construction, model training and model evaluation is the assumption that the future will be the same as the past.

“In effect, ML algorithms search through the past for patterns that might generalize to the future,” Gade said. “But the future is subject to constant change, and production models can deteriorate in accuracy over time due to data drift.”

Gade suggests that if an organization notices data drift, a good place to start remediation is to check for data integrity issues. The next step is to dive deeper into model performance logs to pinpoint when the change happened and what type of drift is occurring.

“Model explainability measures can be very useful at this stage for generating hypotheses,” Gade said. “Depending on the root cause, resolving a feature drift or label drift issue might involve fixing a bug, updating a pipeline, or simply refreshing your data.”

Playtime is over for data science

There is also a need for the management and monitoring of AI models. Gade said that robust model performance management techniques and tools are important for every company operationalizing AI in their critical business workflows.

The need for companies to be able to keep track of their ML models and ensure they are working as intended was also emphasized by Robinson.

“Playtime is over for data science,” Robinson said. “More specifically, for organizations that create products with models that are making decisions impacting people’s financial lives, health outcomes and privacy, it is now irresponsible for those models not to be paired with appropriate monitoring and controls.”

Read Full Article

What's Hot

Shiba Inu, Solana, And Litecoin Lead The Charge For Crypto Payments | TheSpuzz

‘Cast a vote, but don’t join a cult’ — Edward Snowden at Bitcoin 2024

ChatGPT Voice Mode with GPT-4o model coming to Plus members soon: OpenAI | Tech News

Fixed deposits: Can you double your money in 10 years by investing in FDs? Check rates of these 6 banks to find out | Mint

Invested in debt MF before 1 April 2023? You may pay 40% higher tax on gains | Mint

Confused whether capital gains tax on your asset sale will be short term or long term? Here is a complete guide | Mint

Anxious over capital gains tax tweaks? I-T dept releases FAQs to clear doubts on STCG, LTCG & holding periods | Mint

Funding winter for startups may end with angel tax abolition: DPIIT Secy

WayCool lays off over 200 employees, aims to achieve profitability

Urban Company revenue up 37.3% in Q1FY25, loss narrows to Rs 93 cr in FY24

Angel tax abolition significant milestone, will boost startups: IT Minister

Shiba Inu, Solana, And Litecoin Lead The Charge For Crypto Payments | TheSpuzz

‘Cast a vote, but don’t join a cult’ — Edward Snowden at Bitcoin 2024

‘We have to get rid of the folks who are in the way’ — Senators speak at Bitcoin 2024

Cardano Sets Stage For Chang Hard Fork With Node Upgrade

Shiba Inu, Solana, And Litecoin Lead The Charge For Crypto Payments | TheSpuzz

‘Cast a vote, but don’t join a cult’ — Edward Snowden at Bitcoin 2024

Only 38% Indians debt-free, 40% lack emergency fund: What a survey reveals

‘We have to get rid of the folks who are in the way’ — Senators speak at Bitcoin 2024

Bengaluru Woman Spends Over Rs 16,000 Per Month On Uber: ”More Than Half Of My Rent”

“So Irresponsible”: Man Drives Car With Daughter On His Lap, Video Sparks Concern

Pakistani Woman In US Throws Party To Celebrate Her Divorce, Video Goes Viral

How An Employee Fooled His Boss Into Thinking He Was At Work For A Month

“BMW, Mercedes All Gone”: Gurugram Man Shares Video Of Partially Submerged Cars After Rainfall

Did data drift in AI models cause the Equifax credit score glitch?

Event

ChatGPT Voice Mode with GPT-4o model coming to Plus members soon: OpenAI | Tech News

Train Driver Convicted Over Spain's Worst Crash In Decades That Killed 79

Zoo hatches record number of condor chicks to release into the wild

Ancient Palestine Site Receives UNESCO Tag Amid Raging Conflict In Gaza

SharkNinja’s new coffee machine takes the hard parts out of making espresso

Reddit results not showing up in many search engines, except Google: Report | Tech News

Shiba Inu, Solana, And Litecoin Lead The Charge For Crypto Payments | TheSpuzz

‘Cast a vote, but don’t join a cult’ — Edward Snowden at Bitcoin 2024

ChatGPT Voice Mode with GPT-4o model coming to Plus members soon: OpenAI | Tech News

Only 38% Indians debt-free, 40% lack emergency fund: What a survey reveals

Shiba Inu, Solana, And Litecoin Lead The Charge For Crypto Payments | TheSpuzz

‘Cast a vote, but don’t join a cult’ — Edward Snowden at Bitcoin 2024

ChatGPT Voice Mode with GPT-4o model coming to Plus members soon: OpenAI | Tech News

What's Hot

Did data drift in AI models cause the Equifax credit score glitch?

Event

The risks of data drift on AI models

Playtime is over for data science

Keep Reading

Subscribe to Updates