Information security, data science and cloud computing skills are the most sought-after talents in the marketplace today. Security operations center (SOC) resources — typically analysts and threat hunters — are increasingly needed to combat the growing threat of adversaries launching aggressive campaigns with the latest techniques and technologies.

The World of the Security Data Scientist

While there are several products to identify, detect and contain known threats and any indicator of compromise (IOC), there is very little protection against unknown threats, zero-day exploits and newly identified vulnerabilities. With the explosion of enriched security log data from thousands of servers, devices, databases and applications, managing this highly complex puddle of structured and unstructured data is a humongous task.

Enter the security data scientist.

What Is a Security Data Scientist?

Security data scientists are practitioners with a solid domain knowledge on network security, identity and access management (IAM) and vulnerability management. However, their core expertise lies in the deep conceptual understanding of advanced mathematics and statistical concepts. These include linear algebra, differential equations, probability distributions, quantitative methods and inferential statistics.

Security data scientists have the skills to understand complex algorithms and build advanced models, applying these concepts to real security data sets in single or clustered environments. They are experts in computer programming languages like Python, R, Scala or MATLAB.

They are also deft at using big data technologies, such as Hadoop Distributed File System (HDFS), Elasticsearch, MapReduce and Apache Spark, to architect enterprise-level security data lake solutions. They also have the business knowledge to present complex data visualizations describing data relationships, such as key performance indicators (KPIs), metrics and scorecards, to senior business executives.

Analytics Services

Security organizations need data scientists to organize, aggregate, enrich and transform huge volume of security data sets into meaningful schema and models. They need to understand underlying data relationships using descriptive analytics, such as correlation heat maps, cause and effect diagrams, time series and frequency charts. Once the data is transformed, cleaned and persisted in a structured format, the data scientist can train the machine to learn from labeled historical data sets and predict outcomes using supervised machine learning. They can also detect patterns and classes in unlabeled data using unsupervised techniques, such as clustering, dimensionality reduction and anomaly detection.

False positive classification, pattern analytics, model scoring, topic modeling and rule analytics are other use cases where machine learning and predictive analytics can provide huge benefits to companies. Such projects can help simplify workflow, automate repetitive manual functions and discover new insights and data patterns.

A few organizations today are also employing junior data scientists and data analysts for building security dashboards and simulation models for analyzing, monitoring and reporting using business intelligence tools. As security organizations integrate with mainstream business, security data science will evolve — providing analytics services to other groups, such as fraud analytics, risk analytics, behavior analytics and disaster recovery.

Security analysts today are heads-down on real-time streaming events, IOCs and intelligence feeds. They have little bandwidth to research unknown threats or identify historical data anomalies.

A security data scientist has the skills and training to perform these advanced analytics tasks on data at rest and in motion — supporting analysts and providing deep insights to the chief information security officer (CISO) and the business. If you have taken the time to bake the cake, make sure to add the icing.

More from Intelligence & Analytics

X-Force Threat Intelligence Index 2024 reveals stolen credentials as top risk, with AI attacks on the horizon

4 min read - Every year, IBM X-Force analysts assess the data collected across all our security disciplines to create the IBM X-Force Threat Intelligence Index, our annual report that plots changes in the cyber threat landscape to reveal trends and help clients proactively put security measures in place. Among the many noteworthy findings in the 2024 edition of the X-Force report, three major trends stand out that we’re advising security professionals and CISOs to observe: A sharp increase in abuse of valid accounts…

Web injections are back on the rise: 40+ banks affected by new malware campaign

8 min read - Web injections, a favored technique employed by various banking trojans, have been a persistent threat in the realm of cyberattacks. These malicious injections enable cyber criminals to manipulate data exchanges between users and web browsers, potentially compromising sensitive information. In March 2023, security researchers at IBM Security Trusteer uncovered a new malware campaign using JavaScript web injections. This new campaign is widespread and particularly evasive, with historical indicators of compromise (IOCs) suggesting a possible connection to DanaBot — although we…

Accelerating security outcomes with a cloud-native SIEM

5 min read - As organizations modernize their IT infrastructure and increase adoption of cloud services, security teams face new challenges in terms of staffing, budgets and technologies. To keep pace, security programs must evolve to secure modern IT environments against fast-evolving threats with constrained resources. This will require rethinking traditional security strategies and focusing investments on capabilities like cloud security, AI-powered defense and skills development. The path forward calls on security teams to be agile, innovative and strategic amidst the changes in technology…

Topic updates

Get email updates and stay ahead of the latest threats to the security landscape, thought leadership and research.
Subscribe today