Comment on page
Google Dorking
In this tutorial, we’ll create a Context Menu action on the AngelList company profile page that searches for PDF files within the company's URL. (For instance, you can search for ebooks or quarterly earnings documents from a company’s name.) To do this, we'll trigger a Google search that uses Google's advanced search operators. In the Open Source Intelligence (OSINT) community, this is called Google Dorking.
Here are three examples of Google Dork searches:
inurl:resume "peter parker"
(finds sites with "resume" in the URL and "peter parker" in the text)related:tesla.com
(finds sites related to tesla.com)site:hubspot.com filetype:PDF
(find PDFs hosted on the HubSpot website)
We'll use the HubSpot AngelList profile page in this tutorial to develop our workflow.
The first time you open the Page Editor on a new webpage, you need to grant PixieBrix access to the page. You can grant permanent access by either:
- Clicking Grant Permanent Access, or
- Granting temporary access by clicking on the PixieBrix extension in the Chrome Extensions dropdown and then refreshing the page
If you had the Page Editor open before navigating to the AngelList page, you'll need to close the Page Editor by clicking the X in the top right corner and then re-open it to provide access.
Click Add in the top left of the Page Editor and choose Context Menu
When you right-click on the page, you should see a PixieBrix context menu item
In the Configuration panel, change the Title from “Context menu item” to something more descriptive, such as “Google Dork”
Now your context menu will look like this:
We only want this context menu to appear on AngelList company pages.
Therefore, we should change the Sites configuration from
https://angel.co/*
to https://angel.co/company/*
In the Advanced configuration, change the Target Mode to Document
We'll need to add the Extract from Page brick. To add a new brick, click the + button in the Brick Actions
Search "extract from page" and choose the brick.
Click the pointer icon below Value and hover your mouse over hubspot.com in the ABOUT HUBSPOT section until the shaded blue area looks like the image below. Then click to select this page element:
Click the “Preview” tab on the right side of the Page Editor, and you’ll see the following Raw Data:
In the Selectors section of the Input configuration, change
property
to companyUrl
The Raw Data section will now look like this:
Note that the selector value should be
#main dl div > ul
. If not, you can manually type this in or choose it from the dropdown.In the Preview tab, click the page icon next to
companyUrl
to copy its property path, @data.companyUrl
, to your clipboard.- Click the + button and search for this brick, and select it.
The Google Dork for searching PDFs on a website is:
- 1.site:<domain>
- 2.filetype:PDF
We can express this Goole search as:
site:{{@data.companyUrl}} filetype:PDF
Press the Google Dork context menu item to test your search. It should open Google in a new tab and execute the following search:
Try it from different AngelList company pages to see how the search dynamically changes. Once you're happy with the button, click Save to use this search in the future.
You can modify this search by changing the Google Dork expression. Check out these sites for inspiration:
Last modified 30d ago