AINeutralarXiv โ CS AI ยท 5h ago1
๐ง
AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding
Researchers introduced AttackSeqBench, a new benchmark designed to evaluate large language models' capabilities in understanding and reasoning about cyber attack sequences from threat intelligence reports. The study tested 7 LLMs, 5 LRMs, and 4 post-training strategies to assess their ability to analyze adversarial behaviors across tactical, technical, and procedural dimensions.