Dataset for Training Large Language Models Found to Contain Active Secrets

3 Mar 2025

CybersecurityDataLeaksSecureCodingAuthentication

A dataset used to train large language models (LLMs) has been discovered to contain nearly 12,000 active secrets, enabling successful authentication. This discovery highlights the security risks posed by hardcoded credentials, which affect both users and organizations. Additionally, it exacerbates the issue when LLMs suggest non-secure coding practices to their users.

Read the original article on thehackernews.com