Ladder

License go.mod Go version GitHub tag (with filter) GitHub (Pre-)Release Date GitHub Downloads all releases GitHub Build Status (with event)
*Ladder is a web proxy to help bypass paywalls.* This is a selfhosted version of [1ft.io](https://1ft.io) and [12ft.io](https://12ft.io). It is inspired by [13ft](https://github.com/wasi-master/13ft). ### Why Freedom of information is an essential pillar of democracy and informed decision-making. While media organizations have legitimate financial interests, it is crucial to strike a balance between profitability and the public's right to access information. The proliferation of paywalls raises concerns about the erosion of this fundamental freedom, and it is imperative for society to find innovative ways to preserve access to vital information without compromising the sustainability of journalism. In a world where knowledge should be shared and not commodified, paywalls should be critically examined to ensure that they do not undermine the principles of an open and informed society. > **Disclaimer:** This project is intended for educational purposes only. The author does not endorse or encourage any unethical or illegal activity. Use this tool at your own risk. ### Features - [x] Bypass Paywalls - [x] Remove CORS headers from responses, assets, and images ... - [x] Apply domain based ruleset/code to modify response - [x] Keep site browsable - [x] API - [x] Fetch RAW HTML - [x] Custom User Agent - [x] Custom X-Forwarded-For IP - [x] [Docker container](https://github.com/everywall/ladder/pkgs/container/ladder) (amd64, arm64) - [x] Linux binary - [x] Mac OS binary - [x] Windows binary (untested) - [x] Removes most of the ads (unexpected side effect ¯\\\_(ツ)_/¯ ) - [x] Basic Auth - [x] Disable logs - [x] No Tracking - [x] Limit the proxy to a list of domains - [x] Expose Ruleset to other ladders - [ ] Optional TOR proxy - [ ] A key to share only one URL - [ ] Fetch from Google Cache if not available ### Limitations Certain sites may display missing images or encounter formatting issues. This can be attributed to the site's reliance on JavaScript or CSS for image and resource loading, which presents a limitation when accessed through this proxy. If you prefer a full experience, please consider buying a subscription for the site. Some sites do not expose their content to search engines, which means that the proxy cannot access the content. A future version will try to fetch the content from Google Cache. ## Installation > **Warning:** If your instance will be publicly accessible, make sure to enable Basic Auth. This will prevent unauthorized users from using your proxy. If you do not enable Basic Auth, anyone can use your proxy to browse nasty/illegal stuff. And you will be responsible for it. ### Binary 1) Download binary [here](https://github.com/everywall/ladder/releases/latest) 2) Unpack and run the binary `./ladder` 3) Open Browser (Default: http://localhost:8080) ### Docker ```bash docker run -p 8080:8080 -d --name ladder ghcr.io/everywall/ladder:latest ``` ### Docker Compose ```bash curl https://raw.githubusercontent.com/everywall/ladder/main/docker-compose.yaml --output docker-compose.yaml docker-compose up -d ``` ### Helm See [README.md](/helm-chart/README.md) in helm-chart sub-directory for more information. ## Usage ### Browser 1) Open Browser (Default: http://localhost:8080) 2) Enter URL 3) Press Enter Or direct by appending the URL to the end of the proxy URL: http://localhost:8080/https://www.example.com Or create a bookmark with the following URL: ```javascript javascript:window.location.href="http://localhost:8080/"+location.href ``` ### API ```bash curl -X GET "http://localhost:8080/api/https://www.example.com" ``` ### RAW http://localhost:8080/raw/https://www.example.com ### Running Ruleset http://localhost:8080/ruleset ## Configuration ### Environment Variables | Variable | Description | Value | | --- | --- | --- | | `PORT` | Port to listen on | `8080` | | `PREFORK` | Spawn multiple server instances | `false` | | `USER_AGENT` | User agent to emulate | `Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)` | | `X_FORWARDED_FOR` | IP forwarder address | `66.249.66.1` | | `USERPASS` | Enables Basic Auth, format `admin:123456` | `` | | `LOG_URLS` | Log fetched URL's | `true` | | `DISABLE_FORM` | Disables URL Form Frontpage | `false` | | `FORM_PATH` | Path to custom Form HTML | `` | | `RULESET` | URL to a ruleset file | `https://raw.githubusercontent.com/everywall/ladder/main/ruleset.yaml` or `/path/to/my/rules.yaml` | | `EXPOSE_RULESET` | Make your Ruleset available to other ladders | `true` | | `ALLOWED_DOMAINS` | Comma separated list of allowed domains. Empty = no limitations | `` | | `ALLOWED_DOMAINS_RULESET` | Allow Domains from Ruleset. false = no limitations | `false` | `ALLOWED_DOMAINS` and `ALLOWED_DOMAINS_RULESET` are joined together. If both are empty, no limitations are applied. ### Ruleset It is possible to apply custom rules to modify the response. This can be used to remove unwanted or modify elements from the page. The ruleset is a YAML file that contains a list of rules for each domain and is loaded on startup See in [ruleset.yaml](ruleset.yaml) for an example. ```yaml - domain: example.com # Inbcludes all subdomains domains: # Additional domains to apply the rule - www.example.de - www.beispiel.de headers: x-forwarded-for: none # override X-Forwarded-For header or delete with none referer: none # override Referer header or delete with none user-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/119.0.0.0 Safari/537.36 cookie: privacy=1 regexRules: - match: ]*\s+)?src="(/)([^"]*)" replace: - domain: www.anotherdomain.com # Domain where the rule applies paths: # Paths where the rule applies - /article googleCache: false # Use Google Cache to fetch the content regexRules: # Regex rules to apply - match: ]*\s+)?src="(/)([^"]*)" replace: