All Insights
Automationagentic clis7 min

Agentic CLIs Are Redefining OSINT Collection Workflows

David Greenhill
David GreenhillTechnical Lead
·

The OSINT analyst's workflow has traditionally been manual: identify a target, choose a tool, run it, interpret the output, decide the next step, repeat. Every decision point requires a human in the loop. Agentic CLI frameworks collapse that loop.

What Makes a CLI "Agentic"

A conventional CLI tool executes a command and returns output. An agentic CLI tool does something fundamentally different: it takes a high-level objective, decomposes it into tasks, selects and executes tools, evaluates results, and decides what to do next — all within a terminal session.

Frameworks like Claude Code, Goose, and Aider represent this shift. They're not wrappers around existing tools. They're autonomous agents that happen to operate in a terminal environment.

The OSINT Application

Consider a typical OSINT collection task: "Map the digital footprint of organization X." A human analyst would:

  • Search for the organization's domains
  • Run DNS enumeration on each domain
  • Check certificate transparency logs
  • Search social media platforms
  • Cross-reference findings across sources
  • Document everything with timestamps

An agentic CLI collapses this into a single directive. The agent plans the collection, executes each step, evaluates whether the results warrant follow-up, and compiles findings into structured output — all while maintaining a chain of custody log.

Tool Orchestration

The real power isn't in any single tool — it's in the orchestration layer. An agentic CLI can:

  • Select tools dynamically: Choose between whois, dig, subfinder, or amass based on what the target requires
  • Chain outputs: Feed the results of one tool into the next without human intervention
  • Handle failures gracefully: If a tool times out or returns empty results, try an alternative approach
  • Respect rate limits: Automatically throttle requests to avoid detection or blocking
  • Maintain context: Remember what's been collected and what gaps remain

Evidence Integrity in Automated Collection

Automation introduces a chain of custody challenge: if no human observed the collection, how do you attest to what happened? TCI's approach:

  • Immutable session logs: Every command, output, and decision logged to WORM storage
  • SHA-256 hashing: Each collected artifact hashed at capture time
  • RFC 3161 timestamps: Trusted timestamps on every hash
  • Decision audit trail: The agent's reasoning for each step recorded alongside the output

The automation actually improves chain of custody over manual workflows, because the logging is comprehensive and automatic rather than dependent on an analyst remembering to document every step.

Operational Considerations

Agentic collection isn't a magic bullet. Practitioners need to consider:

  • Scope control: An autonomous agent needs clear boundaries to avoid collecting outside the authorized scope
  • Attribution risk: Automated collection patterns can be fingerprinted — agents need randomization and rate control
  • Quality validation: Automated collection can produce volume without quality — human review of key findings remains essential
  • Legal compliance: The same legal frameworks that govern manual OSINT apply to automated collection

Where This Is Going

The trajectory is clear: OSINT collection is becoming an infrastructure problem rather than a human skills problem. The analyst's role shifts from "operate the tools" to "define the objectives, validate the findings, and make the judgments that require human context."

TCI is building collection pipelines around agentic CLI frameworks because the productivity gain is too significant to ignore. A single analyst with an agentic CLI can match the collection throughput of a small team.

David Greenhill

Written by

David Greenhill

Technical Lead, The Commonlight Initiative

Need help with your evidence infrastructure?

TCI builds capture pipelines, redaction workflows, and air-gapped processing systems for organizations handling sensitive data.