The Media AI backstory: I'll try and keep it short.

I worked in PR for a bunch of tech brands, loved every minute of it, especially the Media Relations work.

You might have heard of OnePlus, this was the last and arguably most fun one.

The bread and butter of this work is emailing journalists and creators back and forth with upcoming product launch information as well as RSVP's to events.

The key issue: maintaining a contacts list that is large, robust and updated. Most PR folks do this with a mixture of spreadsheets stored locally and a media database if they have $$$ - usually from the 'big two' i.e Meltwater or Cision.

I used those two platforms, I even ended up being a consultant at the latter. They're both ok. Could be better. But the main issue? They're bloody expensive.

Thus, when i started my agency, I know I had to build my own affordable media database for my own use as well as perhaps other PR's and Social Media managers.

The Challenge

Building a PR media database at scale isn’t always glamorous—especially when you start with 45,000+ rows of messy data in Excel. But it is possible to go from a humble spreadsheet to a robust, custom-built platform using a blend of no-code and low-code tools.

Here’s my story of how I did it, the pitfalls I encountered, and how I finally landed on a solution that actually performs.

1. The Data Dilemma: 30,000 Journalists & 5,000 Contacts

Where it began:

I had a massive Excel spreadsheet containing 30,000 journalist records and 5,000 additional media contacts. This data came from years of collecting business cards, email signatures, and publicly available websites.

The goal: Transform these static lists into a searchable, dynamic PR media database that would help me (and eventually others) find the right journalists by beat, publication, or location—and do it all without incurring typical enterprise software costs.

The first question: Where the heck do I store this data?

I needed something more sophisticated than Excel or Google Sheets.

I wanted an interface that was intuitive for non-engineers, so I started exploring no-code tools.

2. Experimenting with Airtable: Great Start, But Not for 35k Rows

Airtable seemed perfect on paper: it’s a spreadsheet-database hybrid with a friendly user interface and plenty of automation integrations.

Importing Data

Exported my Excel sheets as CSV.

Imported ~30,000 journalist records and ~5,000 other PR contacts into Airtable.

The initial setup was surprisingly simple: I created custom fields for Name, Email, Publication, Social Media Links, etc.

Immediate Hiccups

Performance Issues: Once my base started filling up with tens of thousands of rows, load times lagged significantly. Sorting, filtering, and searching became slow and clunky.

Limitations on Views: Airtable’s grouping and filtering are powerful, but with so many records, the UI was often not as responsive as I needed.

Cost Scaling: For large bases and advanced features, the price climbed quickly.

In short, Airtable is fantastic for smaller, more manageable datasets—but it struggled under the weight of nearly 40k records.

3. Trying Bubble for the Front-End

I still liked the idea of a no-code approach, so I decided to break the problem into two parts:

Front-End (UI): Bubble, a no-code platform known for building web apps quickly.

Backend (Database): Eventually discovered Supabase, but more on that soon.

Why Bubble?

Bubble lets you drag and drop elements, create workflows, and manage your site’s logic without heavy coding.

I hoped that delegating data handling to Bubble’s internal database might improve performance.

What Happened?

Bubble’s editor is powerful, but for very large datasets, it also can bog down.

Once again, I found myself hitting performance bottlenecks when searching or filtering tens of thousands of rows.

It became clear that I needed a dedicated, scalable backend solution.

4. Adopting Supabase for Backend Scalability

Enter Supabase, an open-source Firebase alternative that uses PostgreSQL under the hood. It offers:

- Full-Featured Relational DB: Perfect for large, structured datasets like a media database.
- Scalability: PostgreSQL can handle hundreds of thousands if not millions of rows with minimal slowdown
- APIs & Auth: Built-in authentication and an auto-generated RESTful API let me integrate easily with the front-end of my choice.

Steps to Set Up Supabase:

- Created a new Supabase project.
- Defined a schema mirroring the fields I had in Excel (e.g., name, title, publication, email, social_links, etc.).
- Used Supabase’s dashboard and SQL import features to load the CSV data.
- Verified that my 35,000+ rows imported successfully and quickly!
- Result: The difference was night and day. Queries, sorts, and filters were way faster once the data was in a robust relational database.

5. Integrating Bubble (Front-End) with Supabase

With Supabase in place, I turned back to Bubble for the front-end. My vision: a user-friendly interface for searching, sorting, and tagging journalists. User authentication via Bubble or Supabase’s auth. Minimal code but maximum customization.

Bubble & Supabase Integration Flow: Set up API Calls: In Bubble, I used the API Connector plugin to talk to Supabase’s REST API.

Secure Access: I generated an API key in Supabase and restricted read/write permissions for each table.

Bubble Workflows:

On “Search,” Bubble sends a query to Supabase. Supabase filters results based on the user’s input (e.g., “Tech journalists in California”). Supabase returns data, and Bubble displays it in a responsive table.

Challenges Overcome:

Authentication: Decided whether to handle sign-ups and log-ins via Bubble’s own system or through Supabase’s. Ultimately, I integrated them so that user data syncs back to Supabase for a single source of truth.

Data Privacy:

Ensured the API calls only returned data relevant to the authenticated user’s access level.

6. Scraping Journalist/Creator data with Apify

At this point, I had a working database with journalists’ names, emails, and primary publications. But I really wanted to include additional social media details (like LinkedIn, Twitter, and Instagram handles) and check if they were up to date.

Why Apify?

Apify specializes in web scraping and automation; it has ready-made scrapers (called “actors”) for many popular websites.

I could feed it a list of URLs or queries, and it would return structured data perfect for cross-referencing journalist info.

Process:

- Created an Apify actor to scrape each journalist’s social media link (if known).
- Extracted the bio, follower counts, or any other relevant data.
- Scheduled Apify to run periodically (daily or weekly) to keep data fresh.

Data Format:

Apify returned JSON with fields like social_link, follower_count, description, etc.

Perfect for piping directly into a database.

7. Routing Data with Make.com

Now, I had multiple moving parts:

- Supabase for the main database.
- Bubble for the front-end user interface.
- Apify for scraping social media data.
- I needed an integration layer to orchestrate data flows between these services.

Enter Make.com: A no-code workflow automation tool that connects different apps and web services.

Think of it like Zapier but often more flexible for complex scenarios.

Key Flows: New or Updated Data in Apify → Make.com → Supabase

Whenever Apify scraped new social media info, Make.com grabbed that JSON and updated the corresponding journalist record in Supabase.

Data Validation

Make.com also performed basic data checks, e.g., “Does the email look valid?” or “Is the Twitter handle spelled properly?”

Notifications

I set up email or Slack notifications for major changes, like if Apify found 500 newly updated social handles in a day.

8. The Final Result: A Fast, Scalable PR Media Database

After juggling Excel, Airtable, Bubble, Supabase, Apify, and Make.com, I arrived at a system that:

Scales: 35k+ journalist records load and filter efficiently.

Automates: Apify scrapes new data, Make.com routes it to Supabase in near real-time.

Provides a Clean UI: Bubble delivers a front-end that non-technical users can navigate easily.

Is Cost-Effective: No more paying for seats on enterprise software or dealing with slow interfaces limited by row caps.

Performance Gains: Queries that used to hang for 5–10 seconds in Airtable now execute in under a second in Supabase.

Searching for “Tech journalists in NYC” or “Finance reporters at Forbes” is near-instant.

9. Lessons Learned

Know Your Limits: Tools like Airtable are great up to a certain scale. Beyond that, you need a dedicated database solution.

Decouple Your Front-End and Back-End: Using Bubble for the UI and Supabase for the database meant each part of the system could shine where it performs best.

Automate Early: By integrating Apify and Make.com early on, I avoided manual data entry or scraping tasks that would’ve consumed countless hours.

Plan for Growth: Even if you start with 5k rows, design your database so it can handle 50k—or 500k—because you’ll probably get there faster than you think.

10. What’s Next?

Lists: Use AI to help users quickly assemble the best media lists for sharing with their team.

Inboxes: Connecting popular email clients so users can send email directly within the app.

Chat: Huge! Technically this will be difficult but I want to get to a place where users can directly chat with a AI bot to quickly, assemble and contact at media scale.

Enhancing Search & Filters: I plan to implement full-text search or advanced filters (e.g., “Only show journalists active on Twitter with over 10k followers”).

Analytics Layer: Add a dashboard to see trending journalists or quickly identify which publications are most popular in my database.

Continuous Data Enrichment: Keep discovering new sources to scrape or cross-reference so the data remains fresh and accurate.

In Conclusion

Building a custom PR media database doesn’t have to be an endless headache—if you choose the right tools for the job. I learned this firsthand while wrestling with Excel, wrestling with Airtable, and eventually finding a happy combination of Bubble, Supabase, Apify, and Make.com.

Now, I have a scalable solution that serves my needs (and my users’) without grinding to a halt or blowing up my budget.

If you’re thinking of creating your own large-scale database in the PR or media space, I hope my journey helps you bypass some of the trial-and-error.

Once you get the right tech stack in place, you’ll be amazed at how quickly you can transform raw spreadsheets into powerful, dynamic applications.

PS - Shout out to Prashant and especially Yogesh, couldn't do this without you.

Reply

or to participate

Keep Reading

No posts found