(1) A text file that contains data identified with an embedded tag. See
tagged text and
XML.
(2) A multimedia file that contains a description of its content. See
metadata.
(3) Data identified for training AI systems. An image may be labeled if its file name is descriptive of the content. However, the vast majority of data traversing the Internet is not identified in such a manner and unlabeled data is often labeled manually (see
AI data labeling). However, data can also be labeled by machine to provide greater amounts of input to train other language models. See
large language model.