Google Dorking

🍎 Before you begin this tutorial, make sure you've completed the Developer Quick Start Guide.

Overview

In this tutorial, we’ll create a Context Menu action on the AngelList company profile page that searches for PDF files within the company's URL. (For instance, you can search for ebooks or quarterly earnings documents from a company’s name.) To do this, we'll trigger a Google search that uses Google's advanced search operators. In the Open Source Intelligence (OSINT) community, this is called Google Dorking.

Here are three examples of Google Dork searches:

  • inurl:resume "peter parker" (finds sites with "resume" in the URL and "peter parker" in the text)

  • related:tesla.com (finds sites related to tesla.com)

  • site:hubspot.com filetype:PDF (find PDFs hosted on the HubSpot website)

1. Add a Context Menu

Open the Page Editor

We'll use the HubSpot AngelList profile page in this tutorial to develop our workflow.

Navigate your browser to the HubSpot profile page, and open the PixieBrix Page Editor.

Grant Access

The first time you open the Page Editor on a new webpage, you need to grant PixieBrix access to the page. You can grant permanent access by either:

  • Clicking Grant Permanent Access, or

  • Granting temporary access by clicking on the PixieBrix extension in the Chrome Extensions dropdown and then refreshing the page

If you had the Page Editor open before navigating to the AngelList page, you'll need to close the Page Editor by clicking the X in the top right corner and then re-open it to provide access.

Add a Context Menu

Click Add in the top left of the Page Editor and choose Context Menu

When you right-click on the page, you should see a PixieBrix context menu item

Customize the Title

In the Configuration panel, change the Title from “Context menu item” to something more descriptive, such as “Google Dork”

Now your context menu will look like this:

Configure the Sites

We only want this context menu to appear on AngelList company pages.

Therefore, we should change the Sites configuration from https://angel.co/* to https://angel.co/company/*

Configure the Target Mode

In the Advanced configuration, change the Target Mode to Document

2. Select Data

Select the Page Element

We'll need to add the Extract from Page brick. To add a new brick, click the + button in the Brick Actions

Search "extract from page" and choose the brick.

Click the pointer icon below Value and hover your mouse over hubspot.com in the ABOUT HUBSPOT section until the shaded blue area looks like the image below. Then click to select this page element:

Label the Property

Click the “Preview” tab on the right side of the Page Editor, and you’ll see the following Raw Data:

In the Selectors section of the Input configuration, change property to companyUrl

The Raw Data section will now look like this:

Note that the selector value should be #main dl div > ul. If not, you can manually type this in or choose it from the dropdown.

In the Preview tab, click the page icon next to companyUrl to copy its property path, @data.companyUrl, to your clipboard.

3. Construct a Search

Create Your Search

We'll need to add the Google Search in new tab brick.

  • Click the + button and search for this brick, and select it.

The Google Dork for searching PDFs on a website is:

  1. site:<domain>

  2. filetype:PDF

We can express this Goole search as:

site:{{@data.companyUrl}} filetype:PDF

4. Test Your Search

Press the Google Dork context menu item to test your search. It should open Google in a new tab and execute the following search:

Try it from different AngelList company pages to see how the search dynamically changes. Once you're happy with the button, click Save to use this search in the future.

Continued Learning

You can modify this search by changing the Google Dork expression. Check out these sites for inspiration:

🙋Need some help with building? Head over to the Slack Community, and we’ll gladly help you!

Last updated