A signature on a PDF tells you its provenance: you can prove that you really got the PDF from us, and that we couldn't have created it after a certain date. Library folks like to talk about "authenticity" and "provenance". Does a signature on a PDF web archive mean it's real? Also note that some PDF viewers will truncate very large PDFs, so you may need to try a different viewer if the top and bottom are hidden.ĭeactivating the "unfold thread" option might also give better results in certain cases. If a page fails to capture correctly after a few attempts, let us know. Why didn't my requested URL capture correctly? For this experiment, we're also running custom javascript in the headless browser to remove Twitter UI and make the content easier to read.Ĭaptures can take as long as a minute, because we scroll to load resources from the entire Twitter thread. The software uses a headless Chrome browser to render the page as it would appear to a reader. Twitter captures are made using open source web archiving software we're developing at the Library Innovation Lab for eventual use in our project. How do you make these PDFs (and why does it take so long)? Including the same answer each time is fine. Feel free to provide as much or as little detail as you want about who you are and what you're trying to accomplish. Your answer will help us understand whether this tool is serving its purpose, who it's helping, and what other features it might need. Why do you ask the reason for archiving?Īt the Library Innovation Lab, we build experiments like this to explore what's most important to save in the cultural record and how we can save it. By doing it, you're helping to form our shared cultural memory. "Thinking carefully about what you choose to preserve" is part of the practice of archiving. Library nerd note: societies create much more data than they can save. Please think carefully about what you choose to preserve. Some tweets want to be quiet some want to be forgotten some are by public figures or have public impact or sentimental value and want to be kept around. Not everything on Twitter wants to be archived! On Twitter all kinds of conversations happen at different levels of privacy in the same public space. Why not make a PDF archiving tool for Twitter? Each signed PDF currently includes all images in the page (so you can see full size images that are cropped in the PDF view), the primary video on the page if any, as well as a list of all the t.co links on the thread and their actual destinations. PDFs also let us bundle in additional media as attachments. PDFs let us apply document signatures and timestamps so anyone can check, in the future, that a PDF you download with this site really came from the Harvard Library Innovation Lab and hasn't been edited. You can't tell who made them, or when they were made. There are lots of screenshots of Twitter threads going around. Why make a PDF archiving tool for Twitter? Here's an example PDF we made from this tweet. This site is an experiment by the Harvard Library Innovation Lab to let you download signed PDFs of Twitter URLs. The capture process should take around a minute, at the end of which the resulting sealed PDF will be ready to be downloaded. Submitting this form will open a new tab, in which your request will be processed. url to capture Reason for archiving (why this question?) Unfold thread Capture Request a capture. High-fidelity capture of Twitter threads as sealed PDFs.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |