Splunk : Yes, Virginia, There is a -Santa Claus- Way to Detect Unemployment Fraud

January 12, 2021 at 03:24 am IST

By Andrew Morris January 11, 2021

Fraud rates for Unemployment Insurance Benefits (UIB) and Pandemic Unemployment Assistance (PUA) are out of control. In May 2020, Brian Krebs of Krebsonsecurity published two articles detailing fraud that was occurring in several different state's UIB portals. These states had been warned by the US Secret Service to be on the lookout for this. If you read the articles, the common theme is that many states are missing rudimentary controls for combating fraud. PUA simply exacerbated the fraud problem by offering money to those who formerly didn't qualify for UIB, giving the criminals even more opportunities to make money illegally.

This is not a new topic. A fellow Splunker, Chris Perkins took a detailed look at PUA Fraud3 in this post on LinkedIn, Pandemic Unemployment Assistance Fraud, back in August. It is now January 2021 and, if you aren't watching the news, the States and their constituents continue to be victims of massive fraud attacks.

Even though many states may be behind in their fraud detection techniques, UIB fraud looks very similar to banking fraud. We can take some cues from bank fraud detection and prevent much of this fraud.

Fraud detection, like security, should take a layered approach. We don't skip a layer because it is easy to bypass, as the simple layers stop a lot of problems. Glass doors with locks on them, do keep out many bad guys, even though a rock would effectively nullify that locked door.

Also, by slowing down the bad guy (find a rock or move on) and forcing them to adapt, we slow the rate of attack, allowing our analysts to do more.

Detecting Unemployment Fraud in Splunk

Let's look at some simple searches Splunk Enterprise can do with the data you have to detect fraud; Splunk makes this easy with built-in time series limits and counting operations. Here we will limit our search to a 1 day time span, and look for multiple usernames coming from the same IP address using the dc (distinct count) command:

(NOTE: All data is fictional. Any resemblance to actual persons or ip addresses, living or dead, or actual events is purely coincidental.)


| inputlookup login-data3
| bin span=1d _time |  stats dc(username) as 'unique usernames' by ip_address, _time
| sort by 'unique usernames'
| table 'unique usernames', ip_address
| reverse

Splunk allows us to easily click on data in our reports and drill down - by clicking on the IP address at the top of the list, I can easily see all the different usernames associated.

I would implement this as a scheduled search, and have it ready as a report for analysts to review every morning. I would refine it over time to ignore IP addresses that are known to have multiple users (libraries for example) behind a single IP.

We know that IP address is not always a good metric as often IP addresses are reused or shared, so let's use device identification as another layer of fraud detection. If you are not familiar with 'Device ID', checkout commercial offerings by iovation4 (now a TransUnion company) or open source solutions using clientjs5 or fingerprint.js6. All of these offer mechanisms for identifying a device (computer, phone, tablet) that visits your site, regardless of it's IP address.

We use similar search commands, looking for any device ID with more than 2 users associated with it:

Drilling down on the deviceID we see that the IP addresses have changed. Searching on a single IP address may not have discovered all related accounts.

We also see that most logins have failed, but one did succeed. The successful login should be scrutinized more closely as this is an account that actually gained access. Although the above searches are looking at logins, we could easily use IP address or Device ID to catch fraudsters applying for benefits with lists of identities.

IP Geolocation is another useful tool; it shows how a physical location corresponds to an IP address. In the current climate, not everyone is living where we expect, but IP geolocation can show us hot spots of activity we would not expect. Splunk has IP geolocation built-in and also includes the free version of a popular geolocation database (customers can always install their own version). Our search takes some log data with IP addresses; the last 2 lines of code bring the magic. Here we limit ourselves to only IP addresses in the USA, and we map those to US cities:


| inputlookup login-data2
| eval _time=strptime(_time.'-0700','%m/%d/%y %H:%M%Z')
| iplocation ip_address| where Country = 'United States'
| iplocation ip_address  | geostats latfield=lat longfield=lon count

In looking at the chart, the State of Ohio may expect people to be applying from elsewhere, but why the large bubble in Washington, DC? Hovering and drilldown are easy to do for more information . Clicking on the dot for WDC, we have more details on all activity from that area:

Finally, an expanding area for Splunk that our customers are discovering, is link analysis. There are many ways to do link analysis in Splunk, but here is one example that is currently being used for UIB investigations.

Notice our user 'azhatley' at the center. They are connected to others in multiple ways: shared phone number for one group, shared IP address for another group, and shared destination bank account with others. There are even 2 more smaller groupings on the right side.

I go into more details on link analysis in my .conf20 presentation 'Fraud, the Missing Link,' and I will be creating more in-depth blog posts that dive deeper using Splunk for link analysis in the coming weeks.

This is just the starting point when using Splunk for fraud detection. Most customers grow these searches organically finding more fields to build searches on or combining the output of these searches to create higher fidelity alerts. They then layer in enrichment data (Splunk can make API calls to 3rd party services), machine learning (ML), that is free with Splunk, to detect fraud via supervised ML, or detect outliers or odd clusters via unsupervised ML; and then add automation via Splunk Phantom to let the humans focus on the investigation instead of the mundane repetitive tasks.

Check out the below resources to learn more:

https://krebsonsecurity.com/2020/05/u-s-secret-service-massive-fraud-against-state-unemployment-insurance-programs/
https://krebsonsecurity.com/2020/05/riding-the-state-unemployment-fraud-wave/
https://www.linkedin.com/pulse/pandemic-unemployment-assistance-fraud-chris-perkins/
https://www.iovation.com/
https://clientjs.org/
https://github.com/fingerprintjs/fingerprintjs
https://conf.splunk.com/watch/conf-online.html?search=SEC1164C - /

Attachments

Original document
Permalink

Disclaimer

Splunk Inc. published this content on 11 January 2021 and is solely responsible for the information contained therein. Distributed by Public, unedited and unaltered, on 11 January 2021 21:53:02 UTC

	1st Jan change	Capi.
MICROSOFT CORPORATION	+18.86%	3,322B
SYNOPSYS INC.	+15.57%	91.17B
CADENCE DESIGN SYSTEMS, INC.	+12.99%	84.37B
PALANTIR TECHNOLOGIES INC.	+47.52%	56.41B
DASSAULT SYSTÈMES SE	-20.20%	49.69B
THE TRADE DESK, INC.	+35.77%	47.77B
ATLASSIAN CORPORATION	-25.64%	46.04B
SEA LIMITED	+76.35%	41.02B
TAKE-TWO INTERACTIVE SOFTWARE, INC.	-3.39%	27.21B

1st Jan change

Capi.

MICROSOFT CORPORATION

+18.86%

3,322B

SYNOPSYS INC.

+15.57%

91.17B

CADENCE DESIGN SYSTEMS, INC.

+12.99%

84.37B

PALANTIR TECHNOLOGIES INC.

+47.52%

56.41B

DASSAULT SYSTÈMES SE

-20.20%

49.69B

THE TRADE DESK, INC.

+35.77%

47.77B

ATLASSIAN CORPORATION

-25.64%

46.04B

SEA LIMITED

+76.35%

41.02B

TAKE-TWO INTERACTIVE SOFTWARE, INC.

-3.39%

27.21B

Splunk Inc. Introduces New Security Innovations to Power the SOC of the Future	12/06	CI
Splunk Unveils Next-Generation Data Management Experience At the Edge and Beyond	12/06	CI
Splunk Inc. Introduces Advanced AI Enhancements for Observability, Security and IT Service Intelligence	12/06	CI
Cisco and Splunk Announce Integrated Full-Stack Observability Experience for the Enterprise	05/06	CI
Bitwarden Expands Splunk Cloud Integration for Advanced Event Management	16/05	CI
Splunk Unveils Asset and Risk Intelligence to Revolutionize Proactive Risk Mitigation	06/05	CI
ANALYST RECOMMENDATIONS : Best Buy, Wells Fargo, AMD, Netflix, Nvidia...	20/03
Splunk Inc.(NasdaqGM:SPLK) dropped from FTSE All-World Index	20/03	CI
Splunk Inc.(NasdaqGS:SPLK) dropped from S&P Software & Services Select Industry Index	20/03	CI
Splunk Inc.(NasdaqGS:SPLK) dropped from S&P TMI Index	20/03	CI
Splunk Inc.(NasdaqGS:SPLK) dropped from S&P Global BMI Index	20/03	CI
ANALYST RECOMMENDATIONS : 3M Company, Snowflake, Splunk, Micron, Nvidia...	19/03
How Cisco Will Integrate Splunk Into Company	18/03	MT
Cisco: completes acquisition of Splunk for $28 billion	18/03	CF
Splunk Inc.(NasdaqGS:SPLK) dropped from NASDAQ Composite Index	18/03	CI
Cisco Systems, Inc. completed the acquisition of Splunk Inc. from Hellman & Friedman Capital Partners X, L.P., managed by Hellman & Friedman LLC, BlackRock, Inc., The Vanguard Group, Inc., PRIMECAP Management Company and others for approximately $27 billion..	18/03	CI
Splunk Inc.(NasdaqGS:SPLK) dropped from NASDAQ-100 Index	15/03	CI
Add a little SaaS to your life	14/03
EU Watchdog Green-lights Cisco Systems' Purchase of Splunk	14/03	MT
Cisco gains EU antitrust nod for $28 billion Splunk acquisition	14/03	RE
Oracle posts rise in quarterly profit on strong cloud demand	12/03	RE
Linde to Join Nasdaq-100 Index	11/03	MT
Cisco's Splunk deal set to win unconditional EU antitrust OK, sources say	05/03	RE
GitLab shares drop as 'less conservative' forecast disappoints investors	05/03	RE
Splunk beats quarterly revenue estimates on steady demand for cloud services	28/02	RE

Splunk Inc.

Equities

SPLK

US8486371045

Software

Splunk : Yes, Virginia, There is a -Santa Claus- Way to Detect Unemployment Fraud

Latest news about Splunk Inc.

Chart Splunk Inc.

Company Profile

Sector Other Software