Industry: Financial Services
Use Case: Risk Management & Fraud Detection

A global digital fraud and identity-authentication service, LexisNexis is an online portal accessed by thousands of users worldwide. The portal serves over 5,000 global brands, helping them verify over 84 billion transactions yearly. Yellowbrick dramatically improves the performance and reliability of a critical fraud detection application.

Overview

LexisNexis® ThreatMetrix, is a leader in global digital fraud detection and identity authentication services. Central to its operations is the LexisNexis Digital Identity Network (DIN) powered by a sophisticated ML model. This network, serves over 5,000 brands in 244 countries, illustrating the company’s extensive global reach and influence in digital security.

Key Statistics and Operation

  • The DIN processes over 8 billion transactions monthly across 8.2 billion devices.
  • The system streams 200+ data points and calculates 1,000 extra properties for each transaction, all within an average time of less than 60 milliseconds.
  • Clients utilize a 300TB multi-tenant database over 25,000 times daily, integrating up to 1TB of new data from a data lake via Kafka.
  • The platform adeptly handles complex, simultaneous queries from hundreds of users, accessing data across months and millions of records.

Challenges faced by business users

LexisNexis’s data pipeline was initially built using various technologies, including Apache Kafka, Apache Cassandra, Apache Apex, Apache Impala, and Greenplum. Despite leveraging these advanced technologies, LexisNexis encountered significant operational challenges, especially during peak activity periods. The growing size of data sets and an increasing number of users put a strain on their infrastructure, leading to several critical issues:

  • Data Ingestion Delays: Ingesting data took up to a minute due to small-file writes and necessary compaction.
  • Long Query Completion Times: Customers faced query times up to three minutes, significantly hindering efficiency.
  • Frequent Outages: Unpredictable outages in the data pipeline led to customer frustration.
  • Complex to Change: Implementing business process changes, such as adding new data columns, was a lengthy process, often taking weeks.

Next-Gen Database Needs for DIN:

  1. Flexible Query Capability: Facilitate customer-initiated, ad-hoc queries over a 6-month data period for datasets larger than 3 billion records without preset queries.
  2. Rapid Data Ingestion: Ingest over 5,000 rows per second, with the data being ready for querying within a minute.
  3. Wide Tables: Store a main table with 40,000 rows, 1,200 columns, and more than 1 petabyte of data.
  4. High User and Query Volume: Support over 250 users simultaneously and process more than 100,000 daily queries, keeping query response times below 50 milliseconds.

3X speed from 4X fewer nodes

By transitioning to Yellowbrick, LexisNexis achieved a significant performance boost, integrating smoothly with the existing data pipeline. End-users experienced marked improvements, with most operations completed in milliseconds. This enhancement was realized using only 15 nodes, a quarter of the previous number, and with 80% less memory than the prior solution.

Results include:

  • Improved Customer Experience: Leveraging Yellowbrick’s rapid processing, LexisNexis delivers up-to-date and in-depth insights more efficiently.
  • Minimal Management: Yellowbrick’s automated resource allocation reduces administrative needs, and no manual performance tuning is required.
  • Enhanced Customer Experience: Stability and global distribution of Yellowbrick instances mean reliable service and flexible workload management, improving overall customer satisfaction,

EXPERIENCE THE DIFFERENCE
WITH YELLOWBRICK