Swimming in the AI Data Lake: Why Disclosure and Versions of Record Are More Important than Ever