![Help Ukraine now!](https://war.ukraine.ua/support-ukraine/)

astro-robots-txt

This Astro integration generates a _robots.txt_ for your Astro project during build.

!Release ![License: MIT](https://opensource.org/licenses/MIT)

- Why astro-robots-txt
- Installation
- Usage
- Configuration
- Examples
- Contributing
- Changelog
- Inspirations

---

Why astro-robots-txt?

The _robots.txt_ file informs search engines which pages on your website should be crawled. See Google's own advice on robots.txt to learn more.

For Astro project you usually create the _robots.txt_ in a text editor and place it to the public/ directory.
In that case you must manually synchronize site option in _astro.config.\*_ with Sitemap: record in _robots.txt_.
It brakes DRY principle.

Sometimes, especially during development, it's necessary to prevent your site from being indexed. To achieve this you need to place the meta tag into the section of your pages or add X-Robots-Tag: noindex to the HTTP response header, then add the lines User-agent: * and Disallow: \ to _robots.txt_.
Again you have to do it manually in two different places.

astro-robots-txt can help in both cases on the _robots.txt_ side. See details in this demo repo.

---

Installation

Quick Install

The experimental astro add command-line tool automates the installation for you. Run one of the following commands in a new terminal window. (If you aren't sure which package manager you're using, run the first command.) Then, follow the prompts, and type "y" in the terminal (meaning "yes") for each one.

``sh

`Using NPM`


npx astro add astro-robots-txt
Using Yarn

yarn astro add astro-robots-txt
Using PNPM

pnpx astro add astro-robots-txt

Then, restart the dev server by typing CTRL-C and then npm run astro devin the terminal window that was running Astro. Because this command is new, it might not properly set things up. If that happens, log an issue on Astro GitHub and try the manual installation steps below.


  Manual Install

First, install the astro-robots-txt package using your package manager. If you're using npm or aren't sure, run this in the terminal:

`sh npm install --save-dev astro-robots-txt`

Then, apply this integration to your astro.config.* file using the integrations property:

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { // ... integrations: [robotsTxt()], }`Then, restart the dev server.

`Usage`

The astro-robots-txt integration requires a deployment / site URL for generation. Add your site's URL under your _astro.config.\*_ using the site property.

Then, apply this integration to your _astro.config.\*_ file using the integrations property.

__astro.config.mjs__

`js import { defineConfig } from 'astro/config'; import robotsTxt from 'astro-robots-txt';

export default defineConfig({ site: 'https://example.com',

integrations: [robotsTxt()], });`

Note that unlike other configuration options, site is set in the root defineConfig object, rather than inside the robotsTxt() call.

Now, build your site for production via the astro build command. You should find your _robots.txt_ under dist/robots.txt!

> Warning > If you forget to add asite, you'll get a friendly warning when you build, and the robots.txt file won't be generated.

Example of generated robots.txt file

robots.txt

`text User-agent: * Allow: / Sitemap: https://example.com/sitemap-index.xml`

`Configuration`

To configure this integration, pass an object to the robotsTxt() function call in astro.config.mjs.

__astro.config.mjs__

`js ... export default defineConfig({ integrations: [robotsTxt({ transform: ... })] });`


  sitemap

| Type | Required | Default value | | :-----------------------------: | :------: | :-------------: | |Boolean / String / String[]| No | true |

If you omit the sitemap parameter or set it to true, the resulting output in a _robots.txt_ will be Sitemap: your-site-url/sitemap-index.xml.

If you want to get the _robots.txt_ file without the Sitemap: ... entry, set the sitemap parameter to false.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com', integrations: [ robotsTxt({ sitemap: false, }), ], };`

When the sitemap is String or String[] its values must be a valid URL. Only http or https protocols are allowed.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com', integrations: [ robotsTxt({ sitemap: [ 'https://example.com/first-sitemap.xml', 'http://another.com/second-sitemap.xml', ], }), ], };`


  sitemapBaseFileName

| Type | Required | Default value | | :-----: | :------: | :-------------: | |String| No | sitemap-index |

Sitemap file name before file extension (.xml). It will be used if the sitemap parameter is true or omitted.

:grey_exclamation: @astrojs/sitemap and astro-sitemap integrations have the sitemap-index.xml as their primary output. That is why the default value of sitemapBaseFileName is set to sitemap-index.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ sitemapBaseFileName: 'custom-sitemap', }), ], };`


  host

| Type | Required | Default value | | :-----------------: | :------: | :-------------: | |Boolean / String | No | undefined |

Some crawlers (Yandex) support a Host directive, allowing websites with multiple mirrors to specify their preferred domain.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ host: 'your-domain-name.com', }), ], };`

If the host option is set to true, the Host output will be automatically resolved using the site option from Astro config.


  transform

| Type | Required | Default value | | :------------------------: | :------: | :-------------: | |(content: String): Stringor(content: String): Promise | No | undefined |

Sync or async function called just before writing the text output to disk.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ transform(content) { return# Some comments before the main content.\n# Second line.\n\n${content}; }, }), ], };`


  policy

| Type | Required | Default value | | :--------: | :------: | :---------------------------------: | |Policy[] | No | [{ allow: '/', userAgent: '*' }] |

List of Policy rules

Type Policy

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ policy: [ { userAgent: 'Googlebot', allow: '/', disallow: ['/search'], crawlDelay: 2, }, { userAgent: 'OtherBot', allow: ['/allow-for-all-bots', '/allow-only-for-other-bot'], disallow: ['/admin', '/login'], crawlDelay: 2, }, { userAgent: '*', allow: '/', disallow: '/search', crawlDelay: 10, cleanParam: 'ref /articles/', }, ], }), ], };``

Examples

| Example | Source | Playground |
| ------------- | -------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------- |
| basic | GitHub | Play Online |
| advanced | GitHub | Play Online |

Contributing

You're welcome to submit an issue or PR!

Changelog

See CHANGELOG.md for a history of changes to this integration.

[astro-integration]: https://docs.astro.build/en/guides/integrations-guide/

Inspirations

- gatsby-plugin-robots-txt
- generate-robotstxt
- is-valid-hostname

![Help Ukraine now!](https://war.ukraine.ua/support-ukraine/)

astro-robots-txt

This Astro integration generates a _robots.txt_ for your Astro project during build.

!Release ![License: MIT](https://opensource.org/licenses/MIT)

- Why astro-robots-txt
- Installation
- Usage
- Configuration
- Examples
- Contributing
- Changelog
- Inspirations

---

Why astro-robots-txt?

The _robots.txt_ file informs search engines which pages on your website should be crawled. See Google's own advice on robots.txt to learn more.

astro-robots-txt can help in both cases on the _robots.txt_ side. See details in this demo repo.

---

Installation

Quick Install

``sh

`Using NPM`


npx astro add astro-robots-txt
Using Yarn

yarn astro add astro-robots-txt
Using PNPM

pnpx astro add astro-robots-txt


  Manual Install

First, install the astro-robots-txt package using your package manager. If you're using npm or aren't sure, run this in the terminal:

`sh npm install --save-dev astro-robots-txt`

Then, apply this integration to your astro.config.* file using the integrations property:

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { // ... integrations: [robotsTxt()], }`Then, restart the dev server.

`Usage`

The astro-robots-txt integration requires a deployment / site URL for generation. Add your site's URL under your _astro.config.\*_ using the site property.

Then, apply this integration to your _astro.config.\*_ file using the integrations property.

__astro.config.mjs__

`js import { defineConfig } from 'astro/config'; import robotsTxt from 'astro-robots-txt';

export default defineConfig({ site: 'https://example.com',

integrations: [robotsTxt()], });`

Note that unlike other configuration options, site is set in the root defineConfig object, rather than inside the robotsTxt() call.

Now, build your site for production via the astro build command. You should find your _robots.txt_ under dist/robots.txt!

> Warning > If you forget to add asite, you'll get a friendly warning when you build, and the robots.txt file won't be generated.

Example of generated robots.txt file

robots.txt

`text User-agent: * Allow: / Sitemap: https://example.com/sitemap-index.xml`

`Configuration`

To configure this integration, pass an object to the robotsTxt() function call in astro.config.mjs.

__astro.config.mjs__

`js ... export default defineConfig({ integrations: [robotsTxt({ transform: ... })] });`


  sitemap

| Type | Required | Default value | | :-----------------------------: | :------: | :-------------: | |Boolean / String / String[]| No | true |

If you omit the sitemap parameter or set it to true, the resulting output in a _robots.txt_ will be Sitemap: your-site-url/sitemap-index.xml.

If you want to get the _robots.txt_ file without the Sitemap: ... entry, set the sitemap parameter to false.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com', integrations: [ robotsTxt({ sitemap: false, }), ], };`

When the sitemap is String or String[] its values must be a valid URL. Only http or https protocols are allowed.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com', integrations: [ robotsTxt({ sitemap: [ 'https://example.com/first-sitemap.xml', 'http://another.com/second-sitemap.xml', ], }), ], };`


  sitemapBaseFileName

| Type | Required | Default value | | :-----: | :------: | :-------------: | |String| No | sitemap-index |

Sitemap file name before file extension (.xml). It will be used if the sitemap parameter is true or omitted.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ sitemapBaseFileName: 'custom-sitemap', }), ], };`


  host

| Type | Required | Default value | | :-----------------: | :------: | :-------------: | |Boolean / String | No | undefined |

Some crawlers (Yandex) support a Host directive, allowing websites with multiple mirrors to specify their preferred domain.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ host: 'your-domain-name.com', }), ], };`

If the host option is set to true, the Host output will be automatically resolved using the site option from Astro config.


  transform

| Type | Required | Default value | | :------------------------: | :------: | :-------------: | |(content: String): Stringor(content: String): Promise | No | undefined |

Sync or async function called just before writing the text output to disk.

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

integrations: [ robotsTxt({ transform(content) { return# Some comments before the main content.\n# Second line.\n\n${content}; }, }), ], };`


  policy

| Type | Required | Default value | | :--------: | :------: | :---------------------------------: | |Policy[] | No | [{ allow: '/', userAgent: '*' }] |

List of Policy rules

Type Policy

__astro.config.mjs__

`js import robotsTxt from 'astro-robots-txt';

export default { site: 'https://example.com',

Examples

Contributing

You're welcome to submit an issue or PR!

Changelog

See CHANGELOG.md for a history of changes to this integration.

[astro-integration]: https://docs.astro.build/en/guides/integrations-guide/

Inspirations

- gatsby-plugin-robots-txt
- generate-robotstxt
- is-valid-hostname

astro-robots-txt

astro-robots-txt

Why astro-robots-txt?

Installation

Using NPM

Using Yarn

Using PNPM

Usage

Configuration

Examples

Contributing

Changelog

Inspirations

astro-robots-txt

astro-robots-txt

Why astro-robots-txt?

Installation

Using NPM

Using Yarn

Using PNPM

Usage

Configuration

Examples

Contributing

Changelog

Inspirations

Dist Tags

`Using NPM`

`Usage`

`Configuration`

`Using NPM`

`Usage`

`Configuration`