AI-Powered Root Cause: Correlating File Access with APM via Dynatrace A serverless Lambda pipeline ships FSx for ONTAP audit logs to Dynatrace via the Log Ingest API v2, enabling Davis AI to automatically correlate file access anomalies with application performance degradation. The system identifies root causes such as "500 users hitting the same NFS share simultaneously" causing app slowdowns, with logs visible in the Logs Viewer within one to two minutes. Dynatrace builds a topology map of the entire stack, using time-window correlation and entity connectivity to find causal relationships between storage events and application metrics. We built a serverless Lambda pipeline that ships FSx for ONTAP audit logs to Dynatrace via the Log Ingest API v2. The real value: Dynatrace's Davis AI can automatically correlate file access anomalies with application performance degradation — answering "why is the app slow?" with "because 500 users hit the same NFS share simultaneously." FSx for ONTAP → S3 Access Point → EventBridge Scheduler → Lambda → Dynatrace Log Ingest API v2 │ ▼ Davis AI ┌───────────────────┐ │ Correlates: │ │ • File access │ │ anomalies │ │ • APM metrics │ │ • Infrastructure │ │ health │ │ │ │ → Root cause │ │ in seconds │ └───────────────────┘ Verified on Dynatrace SaaS Trial Tokyo-equivalent region . Logs visible in Logs Viewer within 1-2 minutes. This is Part 11 of the Serverless Observability for FSx for ONTAP https://dev.to/aws-builders/why-your-fsx-for-ontap-audit-logs-deserve-better-than-ec2-kod series. Most observability tools treat storage logs as isolated data. Dynatrace is different — it builds a topology map of your entire stack and uses Davis AI to find causal relationships through time-window correlation and entity connectivity: | Scenario | Without Dynatrace | With Dynatrace | |---|---|---| | App latency spike | "Check the logs" | Davis AI detects temporal correlation: file access to /vol/data/ increased 10x within the same 5-minute window as app response time degradation, connected via topology app → NFS mount → SVM | | Storage I/O anomaly | Manual investigation | Automatic correlation via shared topology entities — Davis identifies which services are affected based on entity relationships | | User reports slow file access | Grep through audit logs | DQL query + topology view showing the full dependency path from user request to storage operation | The key differentiator: Davis AI correlates events across entities that share topology connections within overlapping time windows — not just keyword matching or manual dashboard correlation. ┌─────────────────────────────────────────────────────────┐ │ Event Sources │ ├─────────────────────────────────────────────────────────┤ │ │ │ EventBridge Scheduler │ │ rate 5 minutes ──→ Lambda │ │ │ lists new files via │ │ │ S3 Access Point │ │ │ checkpoint in SSM │ │ ▼ │ │ Dynatrace Log Ingest API v2 │ │ Api-Token auth │ │ │ │ │ EMS Webhook │ │ │ ──→ API GW ──→ Lambda ─────────────┤ │ │ ems handler │ │ │ ▼ │ │ FPolicy Dynatrace │ │ ──→ ECS Fargate ──→ SQS Logs Viewer, │ │ ──→ Bridge Lambda Davis AI, │ │ ──→ EventBridge DQL, │ │ ──→ Lambda fpolicy handler Dashboards │ │ ──────────────────────────────────────────────────────┤│ └─────────────────────────────────────────────────────────┘ When you ship FSx for ONTAP logs to Dynatrace alongside your APM data, Davis AI can detect patterns like: This works because Dynatrace maps your FSx for ONTAP SVM as a custom device entity in its topology, connecting it to the applications that access it. logs.ingest dt0c01.