Does OpenAI use book data, such as ebooks or scanned PDFs?

This table (blow) from the reference above summaries nicely for those who will not have time to read the blog post: