DARKNAVY @DarkNavyOrg

Cybersecurity enthusiasts from DARKNAVY. Achieve, Analyze, Attack *Oops. darknavy.org Joined November 2023

Tweets

105
Followers

5K
Following

77
Likes

113

DARKNAVY @DarkNavyOrg

2 weeks ago

While Mythos showed what frontier model might become, we asked a different question: With a dedicated security harness, can open-source LLMs approach Mythos-level vulnerability research on real targets? Meet deepsec, DARKNAVY's attempt to answer. darknavy.org/blog/deepsec_c…

1 27 130 32K 99

View Details

DARKNAVY @DarkNavyOrg

a month ago

Thank you to @Qualcomm for the invitation! We are thrilled to be working alongside mobile vendors to contribute to the security of the ecosystem. Come chat with our team members if you’re also at the Qualcomm Security Summit!

Xg Xiao @xgxiao66

a month ago

🏙️ Awesome conference in San Diego - Qualcomm Product Security Summit. 📡 Feel free to say hi to me - I am in a DARKNAVY hoody! #qpss26

1 0 18 6K 1

1 2 34 5K 3

View Details

DARKNAVY @DarkNavyOrg

a month ago

Coding agent hacking series 3/3: Cursor. The "Auto-Run in Sandbox" mode of @cursor_ai is great: user-friendly, convenient, and supposedly safer. But just like Codex CLI, following content from a remote URL can chain vulnerabilities from prompt injection to unauthorized command execution outside the sandbox, without further user approval under this mode.

1 6 34 8K 12

View Details

DARKNAVY @DarkNavyOrg

a month ago

Coding agent hacking series 2/3: Codex CLI. It looks seriously secure: sandboxing by default, built in Rust, reviewed by top LLMs from @OpenAI. But in our latest demo, one web fetch can chain multiple vulnerabilities from prompt injection to unauthorized command execution outside the sandbox in one shot!

3 9 57 10K 36

View Details

DARKNAVY @DarkNavyOrg

a month ago

Coding agent hacking series 1/3: Claude Code. @AnthropicAI is building impressively powerful cyber models like Mythos. However, their core coding product can still stumble on security boundaries beyond prompt injection. Our demo shows how web content exploring can be chained with other vulnerabilities to bypass permission checks and execute attacker's commands without your approval ;)

3 6 48 9K 37

View Details

DARKNAVY @DarkNavyOrg

2 months ago

The original video has a “text” issue so we re-uploaded everything. Thanks to @roddux for pointing it out!(4/3)

0 0 15 3K 1

View Details

DARKNAVY @DarkNavyOrg

2 months ago

We are committed to freeing human researchers from tedious, repetitive tasks so they can focus on real innovation. Stay tuned for our upcoming release of an AI-powered, end-to-end security research platform! (3/3)

1 0 28 3K 3

View Details

DARKNAVY @DarkNavyOrg

2 months ago

We obtained root privilege on the S26 (Exynos 2600 Chipset), the latest flagship smartphone from Samsung. To our knowledge, this is the first root exploit for Exynos S26 since Samsung removed bootloader unlocking option in One UI 8. It is exploitable from APP context, so we make a cmd wrapper app for demo👇(1/n)

15 65 335 32K 131

View Details

DARKNAVY @DarkNavyOrg

2 months ago

Our AI Agent popped a root shell on Ubuntu 26.04 on the first day it was released :)

32 98 770 580K 242

View Details

Defi Nerd @Defi_Nerd_sec

3 months ago

😃Just got a bug confirmed on @immunefi that we found using our client auditor skill.

6 4 94 8K 6

View Details

Defi Nerd @Defi_Nerd_sec

3 months ago

On 2026-03-27 03:40:34 PM +UTC, the #EST token / BNBDeposit system on #BSC was exploited through a **flash-loan-assisted reward-accounting flaw** in `BNBDeposit`, amplified by **fee-exempt routing and pair-state manipulation** in EST. Based on our exploit investigation skill: github.com/DarkNavySecuri… Check threads for specific code illustration.

2 3 18 5K 21

View Details

DARKNAVY @DarkNavyOrg

3 months ago

iOS/macOS 26.4 addresses two vulnerabilities we reported before. Both were discovered by our under-development AI agentic system, which is capable of processing both binary and source code ;)

2 10 84 13K 27

View Details

Defi Nerd @Defi_Nerd_sec

3 months ago

Over the past few weeks we've been building AI-powered security skills for Web3, covering smart contract auditing, blockchain client auditing, and onchain exploit investigation. Here is the skills repo👇 github.com/DarkNavySecuri…

2 10 64 16K 67

View Details

DARKNAVY @DarkNavyOrg

3 months ago

We've just open-sourced a preview version of our agent skills for Web3 security! Enjoy your playing :)

Defi Nerd @Defi_Nerd_sec

3 months ago

These skills have helped us earn $21K on Immunefi @immunefi and independently discover a vulnerability in rippled @XRPLF @RippleXDev, the XRP Ledger's core node software, that was officially patched. Every exploit breakdown we've posted before was built with these skills.

1 1 8 5K 6

0 0 17 4K 11

View Details

DARKNAVY @DarkNavyOrg

3 months ago

Our AI agent researcher @Defi_Nerd_sec is delivering in Web3! Although this case was flagged as a duplicate, the agent independently generated a working exploit, going beyond discovery and into execution. Cases like this suggest AI-driven workflows are beginning to cover a much larger share of the exploit chain, putting pressure on the security posture of the entire industry. Glad to see it addressed! @XRPLF @RippleXDev Full credit to the original reporter as well👍

XRP Ledger Foundation @XRPLF

3 months ago

XRP Ledger Software version 3.1.2 is available. This version is fixing an edge case that can cause outages on public facing nodes. Please update your nodes as soon as possible to this new version. More details in the release notes: github.com/XRPLF/rippled/…

19 66 263 52K 10

0 2 13 5K 10

View Details

DARKNAVY @DarkNavyOrg

3 months ago

The bug being exploited was identified during our evaluation of the internal AI Agent, which automatically submit some of the findings with PoCs. Very surprised to see @osec_io take it to the another level! Also look forward to AI automatically generating such complex exploits.

OtterSec @osec_io

3 months ago

We achieved a guest-to-host escape by exploiting a QEMU 0-day where the bytes written out of bounds were uncontrolled. Full breakdown of the technique, glibc allocator behavior, and our heap spray/RIP-control primitive ↓

8 112 549 49K 280

0 3 25 7K 9

View Details

DARKNAVY @DarkNavyOrg

3 months ago

Hi @thezdi @OpenAI, asking for the rules of Pwn2Own26 Coding Agent directory, particularly the "interact with ... repository" If a user opens someone else's git repo using CodeX App with default permissions and is immediately RCE’d, does this fall within the threat model? :)