Run External Link Checker#
Command line#
Install the package with:
pip install siglatools
Run the following command with the correct configurations.
run_external_link_checker -msi <master_spreadsheet_id> -gacp /path/to/google-api-credentials.json
GitHub Actions (for collaborators+ only)#
From the list of workflows, select
Manual Run External Link Checker.Click on the
Run workflowdropdown button.Enter a list of
spreadsheetIds, delimited by,. For examplespreadsheetId-1,spreadsheetId-2,spreadsheetId-3.Click on the green
Run workflowbutton to run the external link checker.Wait until a new workflow run result shows up and click on it see its details.
On workflow details page, once the run is finished click on the
external-link-artifactto download a csv file of bad external URLs.
External Links Reasons#
The tab delimited csv file has a reason fieldname as to why the URL is included for further inspection. Here are the relevant reasons:
404 - Not foundmeans the webpage doesn’t exist.403 - Forbiddenmeans the webpage required authentication or the webpage doesn’t allow a script to visit the webpage. Either way, this webpage requires further inspection.Untrusted SSL Certificatemeans the webpage doesn’t have an acceptable SSL Certificate and the browser may warn users of potential security risks when they visit the webpage.Request timed outmeans it took too long to get a response from the webpage and requires further inspection.Error connectingmeans the script was unable to visit the webpage and requires further inspection.Unknown errormeans the script encountered an unknown error and requires further inspection.