Archiving: Difference between revisions

From Soyjak Wiki, The Free Soycyclopedia
Jump to navigationJump to search
(Added video tutorial. That took me longer than expected to make. The formatting is a bit screwed, so if someone could help me out that'd be great.)
Tag: Made through Tor
(35 intermediate revisions by 19 users not shown)
Line 1: Line 1:
This article is meant to be about how an average 'teen can archive gems because nothing on the internet is permanent!
{{Gem}}
 
This article is about how an average 'teen can archive gems because nothing on the internet is permanent, and less so on the sharty!
 
== Public archive sites ==
You can submit a URL to these sites and view it later.
* https://archive.today
* https://ghostarchive.org (tranny archive{{cn}})
* <s>https://web.archive.org/save</s> 'chive niggers excluded the 'sharty from the Wayback Machine. You can still link to individual media on this 'chive by uploading it as an individual video file, {{glow|but this requires you to data-mine yourself and make an account}}.
 
If you want to be handheld see: [[Archiving/archive.today tutorial]]
 
'''DO NOT''' exclusively use a single 'chive to save webpages. It’s recommended to ‘chive pages via multiple means, otherwise if an archive suddenly disappears or deletes our pages like archive.org did, your archive doesn't become [[lost media]]. Two citations don't hurt.
 
'''<big><big><big>DO NOT USE FUCKING ARCHIVE.ORG THEY HAVE ALREADY WIPED THOUSANDS OF PAGES AFTER A SINGLE SNITCHTROON EMAILED THEM</big></big></big>'''


== Local archiving ==
== Local archiving ==
Save stuff (e.g. images, videos, and even entire webpages) to your local hard drive, preferably several actually. You can use special software for this part.
Truly nothing on the internet is permanent, not even archive sites. That's why it's important to save stuff (e.g. images, videos, and even entire webpages) to your local hard drive and to preferably make backups. You can use special software for this part.


* Ctrl+S on your browser - Ugly
* Ctrl+S on your browser - Works fine on [[Vichan]], however may break if the filename is too large.
* Singlefile extension ([https://chrome.google.com/extensions/detail/mpiodijhokgodhhofbcjdecpffjipkle Chrome]/[https://addons.mozilla.org/firefox/addon/single-file Firefox])  - Good overall. Saves the page as you see it in one .html file. Saves full images if you open them.  
* Singlefile extension ([https://chrome.google.com/extensions/detail/mpiodijhokgodhhofbcjdecpffjipkle Chrome]/[https://addons.mozilla.org/firefox/addon/single-file Firefox])  - Good overall. Saves the page as you see it in one .html file. Saves full images if you open them.  
* Another option: {{quote|>he doesn't have a custom-programmed live sharty-compatible vichan archiver}} {{quote|>[[oldtroon|shiggy]] saving individual threads}}
=== Advanced ===
* [https://archivebox.io/ ArchiveBox] allows you to run your own website archiver locally (can also apparently import from browser bookmarks or history).
* [https://github.com/yt-dlp/yt-dlp yt-dlp] - Download videos from YouTube and many other sites.
* [https://github.com/bbepis/Hayden Hayden] - Newgen imageboard archiving software.
* {{quote|>he doesn't have a custom-programmed live sharty-compatible vichan archiver}} {{quote|>[[oldfag|shiggy]] saving individual threads}}
 
== Lost archives ==
There have been '''three''' separate incidents in which where the sharty lost access to archives.
 
The first was when the [[logwarehouse]] shut down on June 5, 2023, which caused a massive loss of images from late November 2022 to early June 2023. Though the images of the Log Warehouse were lost, the textual content has been [https://archive.org/details/logwarehouse archived] and is available as a torrent.


== Archive sites ==
The second time was when the sharty was excluded from the Wayback Machine on December 13th after they were snitched on, unleashing a catastrophic loss of 10,000+ archives, especially ones from the early days of the site. Most of these were only archived on Wayback Machine and are now permanently lost.
DO NOT exclusively use archive.org to save webpages. Uploading files there seems to be okay THOUGH. Use archive.ph for saving pages instead


* https://web.archive.org/save
The third time was when [[#Shinny archive|archive.soyjak.in]] went down, but you'd expect we wisened up by the third time it happened.
* https://archive.today
* https://ghostarchive.org (tranny archive)


===Archive.today/Archive.ph===
==/chive/==
If you're fed up with the snail speed of the Internet Archive, or wary of using it since 4chan was blacklisted on it<ref>https://web.archive.org/web/20230000000000*/4chan.org</ref> then you WILL opt for [https://archive.today Archive.today].
Particularly important threads, like Q&As, are also sometimes archived by moderators on-site on /chive/.


====Simple archive====
==Shinny archive==
To do a simple archive, simply paste the thread link into the field that reads "My url is alive and I want to archive its content" and then press the button that says "save". This alone is okay for things like history documentation, especially if a thread is looooong (like [[Giggly Goonclown#The Thread|this one]]).
{{Over}}
[https://archive.soyjak.in/ archive.soyjak.in]<sup>†</sup> was a replacement for now defunct [[Afterparty|Log Warehouse]] based on Hayden. <s>While it is fast and simple, it unfortunately lacks any sort of search features. It is also recommended that you archive particularly important threads manually.</s>


====Archiving media====
==4chan archives==
If you [[Science Lover Soyjak | HATE being stuck with thumbnails]], then you can open each original image as it appears in the thread in a new tab and archive each individual link it the same manner you would a thread. You can also archive videos in the same way, but they will NOT embed on Archive.today and you WILL have to download them onto your device to play them after the thread dies.
*[https://desuarchive.org desuarchive]
*[https://arch.b4k.co arch.b4k.co]
*[https://archive.4plebs.org 4plebs]
*[https://archived.moe archived.moe] - has a lot of boards but is slow and has haram ads
*[https://old.sage.moe old.sage.moe] - mid-late 2000s 4chan text archive


====Video tutorial====
==See also==
If you found the text tutorial hard to follow, here's a video tutorial. [[File:Soyarchivetutorial480p.mp4|thumb|left]]
*[[SoyBooru/Scraping]]
*[[Booru archive]]


{{reflist}}
{{reflist}}

Revision as of 14:54, 8 April 2024

This page is a gem.


This article is about how an average 'teen can archive gems because nothing on the internet is permanent, and less so on the sharty!

Public archive sites

You can submit a URL to these sites and view it later.

If you want to be handheld see: Archiving/archive.today tutorial

DO NOT exclusively use a single 'chive to save webpages. It’s recommended to ‘chive pages via multiple means, otherwise if an archive suddenly disappears or deletes our pages like archive.org did, your archive doesn't become lost media. Two citations don't hurt.

DO NOT USE FUCKING ARCHIVE.ORG THEY HAVE ALREADY WIPED THOUSANDS OF PAGES AFTER A SINGLE SNITCHTROON EMAILED THEM

Local archiving

Truly nothing on the internet is permanent, not even archive sites. That's why it's important to save stuff (e.g. images, videos, and even entire webpages) to your local hard drive and to preferably make backups. You can use special software for this part.

  • Ctrl+S on your browser - Works fine on Vichan, however may break if the filename is too large.
  • Singlefile extension (Chrome/Firefox) - Good overall. Saves the page as you see it in one .html file. Saves full images if you open them.

Advanced

  • ArchiveBox allows you to run your own website archiver locally (can also apparently import from browser bookmarks or history).
  • yt-dlp - Download videos from YouTube and many other sites.
  • Hayden - Newgen imageboard archiving software.
  • >he doesn't have a custom-programmed live sharty-compatible vichan archiver >shiggy saving individual threads

Lost archives

There have been three separate incidents in which where the sharty lost access to archives.

The first was when the logwarehouse shut down on June 5, 2023, which caused a massive loss of images from late November 2022 to early June 2023. Though the images of the Log Warehouse were lost, the textual content has been archived and is available as a torrent.

The second time was when the sharty was excluded from the Wayback Machine on December 13th after they were snitched on, unleashing a catastrophic loss of 10,000+ archives, especially ones from the early days of the site. Most of these were only archived on Wayback Machine and are now permanently lost.

The third time was when archive.soyjak.in went down, but you'd expect we wisened up by the third time it happened.

/chive/

Particularly important threads, like Q&As, are also sometimes archived by moderators on-site on /chive/.

Shinny archive

Are you sure that it's just getting started? Because my peer-reviewed studies indicate that IT'S OVER.

archive.soyjak.in was a replacement for now defunct Log Warehouse based on Hayden. While it is fast and simple, it unfortunately lacks any sort of search features. It is also recommended that you archive particularly important threads manually.

4chan archives

See also


Citations