Click any annotated section or its icon to see analysis.
Section 1
1. Short title This Act may be cited as the Generative AI Copyright Disclosure Act of 2024.
Section 2
2. Notice to be submitted to the Register of Copyrights with respect to copyrighted works used in building generative AI systems A person who creates a training dataset, or alters a training dataset (including by making an update to, refining, or retraining the dataset) in a significant manner, that is used in building a generative AI system shall submit to the Register a notice that contains— a sufficiently detailed summary of any copyrighted works used— in the training dataset (in the case that the person creates the dataset); or to alter the training dataset (in the case that the person alters the training data in a significant manner); and the URL for such dataset (in the case of a training dataset that is publicly available on the internet at the time the notice is submitted). The notice required by paragraph (1) shall be submitted— not later than 30 days before the generative AI system with respect to which the training dataset is used is made available to consumers, in the case that the generative AI system is first made available to consumers after the date on which this Act takes effect; and not later than 30 days after the date on which this Act takes effect, in the case that the generative AI system with respect to which the training dataset was used was made available to consumers before the effective date of this Act. Any person described under paragraph (1) of subsection (a) that fails to comply with a requirement under such subsection shall be assessed a civil penalty in an amount not less than $5,000. Not later than 180 days after the date on which this Act takes effect, the Register shall issue regulations to implement the requirement under paragraph (1). The Register shall establish and maintain a publicly available online database that contains each notice filed under subsection (a)(1). In this section: The term Artificial Intelligence means an automated system designed to perform a task typically associated with human intelligence or cognitive function. The term copyrighted work means a work protected in the United States under a law relating to copyrights. The term generative AI model means a combination of computer code and numerical values designed to use Artificial Intelligence to generate outputs in the form of expressive material such as text, images, audio, or video. The term generative AI system means a software product or service that— substantially incorporates one or more generative AI models; and is designed for use by consumers. The term Register means the Register of Copyrights. The term training dataset means a collection of individual units of material (including a combination of text, images, audio, or other categories of expressive material, as well as annotations describing the material) used to train a generative AI model. This Act shall take effect on the date that is 180 days after the date of the enactment of this Act.