T1119 Sumo Logic CSE · Sumo

Detect Automated Collection in Sumo Logic CSE

Once established within a system or network, an adversary may use automated techniques for collecting internal data. Methods for performing this technique could include use of a Command and Scripting Interpreter to search for and copy information fitting set criteria such as file type, location, or name at specific time intervals. In cloud-based environments, adversaries may also use cloud APIs, data pipelines, command line interfaces, or ETL services to automatically collect data. This functionality could also be built into remote access tools. This technique may incorporate use of other techniques such as File and Directory Discovery and Lateral Tool Transfer to identify and move files, as well as Cloud Service Dashboard and Cloud Storage Object Discovery to identify resources in cloud environments.

MITRE ATT&CK

Tactic
Collection
Technique
T1119 Automated Collection
Canonical reference
https://attack.mitre.org/techniques/T1119/

Sumo Detection Query

Sumo Logic CSE (Sumo)
sql
_sourceCategory="windows/sysmon" OR _sourceCategory="os/windows/security"
| where EventID = "1" OR EventID = "4688"
| parse field=CommandLine "*" as cmdline nodrop
| parse field=Image "*" as process_image nodrop
| eval cmdline_lower = toLowerCase(cmdline)
| eval proc_lower = toLowerCase(process_image)
// Flag recursive file search patterns
| eval RecursiveSearch = if (
    cmdline_lower matches "*(get-childitem|gci |-recurse|dir /s|forfiles|os\.walk|glob\.glob|scandir)*",
    1, 0
  )
// Flag sensitive file extensions
| eval SensitiveExt = if (
    cmdline_lower matches "*(\.docx?|\.xlsx?|\.pdf|\.pptx?|\.pst|\.ost|\.mdb|\.accdb|\.kdbx|\.pfx|\.pem|\.p12|\.key|\.csv)*",
    1, 0
  )
// Flag archive tools
| eval ArchiveTool = if (
    proc_lower matches "*(rar\.exe|winrar\.exe|7z\.exe)*",
    1, 0
  )
// Flag mass copy operations
| eval MassCopy = if (
    proc_lower matches "*(robocopy\.exe|xcopy\.exe)*"
    and cmdline_lower matches "*(/s|/e|/mir)*",
    1, 0
  )
// Flag script interpreters
| eval ScriptInterpreter = if (
    proc_lower matches "*(powershell\.exe|pwsh\.exe|python[23]?\.exe|wscript\.exe|cscript\.exe)*",
    1, 0
  )
// Flag credential-adjacent file types
| eval CredentialFiles = if (
    cmdline_lower matches "*(\.pfx|\.pem|\.p12|\.key|\.kdbx)*",
    1, 0
  )
// Composite suspicion score
| eval SuspicionScore = RecursiveSearch + SensitiveExt + ArchiveTool + MassCopy + CredentialFiles
// Apply detection logic
| where (ScriptInterpreter = 1 and RecursiveSearch = 1 and SensitiveExt = 1)
    or MassCopy = 1
    or (ArchiveTool = 1 and SensitiveExt = 1)
    or SuspicionScore >= 2
| fields _messageTime, Computer, User, Image, CommandLine, ParentImage, ParentCommandLine,
         RecursiveSearch, SensitiveExt, ArchiveTool, MassCopy, CredentialFiles, SuspicionScore
| sort by SuspicionScore desc, _messageTime desc
high severity high confidence

Detects automated data collection (T1119) in Sumo Logic by parsing Sysmon Event ID 1 and Security Event ID 4688 process creation logs. Uses a suspicion scoring model to identify recursive enumeration via PowerShell/CMD/Python, mass copy operations, and document archiving with sensitive file type targeting.

Data Sources

Sysmon via Sumo Logic Windows CollectorWindows Security Event Log via Sumo Logic Collector

Required Tables

Sumo Logic _sourceCategory windows/sysmonSumo Logic _sourceCategory os/windows/security

False Positives & Tuning

  • Automated backup jobs using PowerShell scripts with Get-ChildItem and robocopy to copy document libraries to network shares
  • Enterprise DLP or CASB agents performing scheduled file classification scans that enumerate by extension across local drives
  • Developer build pipelines or CI scripts that use Python os.walk or glob to collect artifacts for packaging
Download portable Sigma rule (.yml)

Other platforms for T1119


Testing Methodology

Validate this detection against 4 adversary techniques from Atomic Red Team. Each test below lists the behaviour to exercise and the telemetry you should expect to see. Executable commands and cleanup steps are available with Pro.

  1. Test 1PowerShell Recursive Document Collection to Staging Directory

    Expected signal: Sysmon Event ID 1: Process Create with Image=powershell.exe, CommandLine containing 'Get-ChildItem', '-Recurse', '.docx', '.xlsx', '.pdf'. Sysmon Event ID 11: Multiple file creation events in %TEMP%\df00tech-stage for each copied file. PowerShell ScriptBlock Log Event ID 4104 with the full collection script. Security Event ID 4663 (if SACL auditing enabled) for each source document read.

  2. Test 2CMD Recursive File Enumeration with dir /s

    Expected signal: Sysmon Event ID 1: Process Create with Image=cmd.exe, CommandLine containing 'dir /s /b' and '.docx', '.xlsx', '.pdf'. Sysmon Event ID 11: File creation event for %TEMP%\df00tech-filelist.txt. Security Event ID 4688 (if process creation auditing and command line logging are enabled) with full command line including extension targets.

  3. Test 3forfiles Automated Document Enumeration

    Expected signal: Sysmon Event ID 1: Process Create for the shell executing forfiles with CommandLine containing 'forfiles', '/S', and '.docx'. Child cmd.exe process creation events as forfiles spawns a cmd.exe instance per matching file. Sysmon Event ID 11: File creation for %TEMP%\df00tech-forfiles.txt. The child cmd.exe processes with 'echo @PATH' are also logged individually.

  4. Test 47-Zip Archive Collection — Document Staging Pre-Exfiltration

    Expected signal: Sysmon Event ID 1: Process Create with Image=7z.exe, CommandLine containing 'a' (add to archive), target paths with '.docx', '.xlsx', '.pdf', '-r' (recursive), and '-p' (password). Sysmon Event ID 11: File creation event for %TEMP%\df00tech-archive.7z. Security Event ID 4663 (if auditing enabled) for each source document file opened by 7z.exe during archiving.

Unlock Pro Content

Get the full detection package for T1119 including response playbook, investigation guide, and atomic red team tests.

Response PlaybookInvestigation GuideHunting QueriesAtomic Red Team TestsTuning Guidance

Related Detections