AINIRO Influencer Program - Get a Free Web Scraper
We've always had the best website scraper in the AI chatbot space. You can try it out below to verify that.
It works by taking a URL to an HTML document, transforming the HTML document to Markdown, for then to return the Markdown to the caller. It sounds easy until you try to implement it for yourself. Creating high quality Markdown from HTML is actually very difficult.
The advantages
For us being an AI company delivering high quality AI chatbots, the ability to create Markdown from HTML is crucial. It allows us to deliver AI chatbots that can display hyperlinks, display images, and preserve relative importance of specific parts of the original HTML document as we scrape your website. Try asking our chatbot some questions to understand the benefit here.
Our influencer program
If you're an influencer in the AI space, we would love to barter with you. We'll give you access to our web scraper, allowing you to invoke it n amount of times per month for free, in return for you sharing our website, and/or saying nice things about our commercial offer. "n" here depends upon how many followers you have, and how much you promote us on your social media profiles.
This allows you to create for instance custom GPTs that are using our web scraper beneath the hoods. We've previously written about how to use our web scraper in your own custom GPTs, but below are some examples of kick ass custom GPTs you can create with access to a high quality web scraper.
- Translate GPTs that translates a URL into some language while keeping all information as is
- Summarize GPTs that summarizes the most important parts of articles, allowing users to read them much faster
- Marketing GPTs that creates social media updates or ads based upon my existing content
- Learning GPTs that takes original content and explains it to children using easily understood language
- Summarize today's news GPTs
- Snoop Dogg summary GPT that summarizes a URL in the style of Snoop Dogg (example below)
Since our web scraper preserves images and hyperlinks, no relevant information is actually lost, and your custom GPTs can still display images, hyperlinks, lists, etc, as they are. If you're interested in doing such a barter with us, you can contact us below.
Example GPT
Below we've created an example custom GPT that basically takes any URL, and converts it to a "Snoop Dogg style summary".
Below is a screenshot of the Snoop Dogg GPT in action.
How it works
We'll give you a JWT token and a JSON OpenAPI specification. You'll have to create an "action" for your custom GPT that contains the JSON and is using the JWT token we give you as a "Bearer API key Authentication token". Below is a screenshot.
Below you can find the JSON, but notice it won't work without a Bearer token provided by us.
{
"openapi": "3.1.0",
"info": {
"title": "AINIRO.IO Magic Cloud Cloudlet",
"description": "Interact with your cloudlet",
"version": "v17.3.5"
},
"servers": [
{
"url": "https://ainiro.io"
}
],
"paths": {
"/magic/modules/freebies/scrape-url": {
"get": {
"operationId": "get_freebies_scrape-url",
"description": "Scrapes the specified [url], converts it to Markdown, and returns Markdown to caller.",
"parameters": [
{
"name": "url",
"in": "query",
"schema": {
"type": "string"
}
}
]
}
}
}
}
Example web scraper
Below is an example of our web scraper. Give it a URL, and watch it transform your URL to Markdown to understand the value here. If you copy the resulting Markdown and paste it into ChatGPT, for then to tell it to for instance translate into French while preserving everything as is, including Markdown - You'll rapidly understand its value. Below is an example instruction you can provide to ChatGPT.
Translate the following into French while preserving all Markdown, images, and hyperlinks as is: [MARKDOWN_HERE]
Now of course, the point being, that the above web scraper provides better results than anything else out there that we're aware of, which explains our high quality AI chatbots.
Implementation details
The web scraper works by the following rules though.
- If it takes more than 30 seconds to retrieve the HTML document, it will fail. This is to avoid exhausting our backend server with invocations towards slow servers
- It identifies as with a user agent string of
AINIRO-Crawler 2.0
. If you've got scraping shields on your page, and/or not explicitly allowing for our user agent, it will fail graciously and return the status code returned from the scraped URL as it fails - It only considers MIME types of
text/html
, and will graciously fail with status code 415 if it's not given HTML content - It scrapes the specified URL with the an accept header of
text/html
, and accept encoding ofidentity
Conclusion
If you're an AI influencer, and you want to create some kick ass custom GPTs, based upon website scraping - Then feel free to reach out to us, since we might be willing to do some bartering here, and give you access to our web scraper free of charge in return for some social media luvin', marketing, and recommendations to your herd ...
When you contact us, provide us with links to your social media profiles, such that we can verify you're actually an AI influencer.
Have a Custom AI Solution
At AINIRO we specialise in delivering custom AI solutions and AI chatbots. If you want to talk to us about how we can help you implement your next custom AI solution, you can reach out to us below.