Utility to automatically link the URLs, email addresses, phone numbers, hashtags, and mentions (Twitter, Instagram) in a given block of text/HTML
npm install autolinker!version




Automatic linking of URLs, emails, phone numbers, mentions, and hashtags in text.
``
Input: "Visit google.com"
|
|
v
Output: "Visit google.com"
`
Because I had so much trouble finding a good auto-linking implementation out in
the wild, I decided to roll my own. It seemed that everything I found was either
an implementation that didn't cover every case, had many false positives linked,
or was just limited in one way or another.
So, this utility attempts to handle everything. It:
- Autolinks URLs, whether or not they start with the protocol (i.e. 'http://').
In other words, it will automatically link the text "google.com", as well as
"http://google.com". Will also autolink IPv4 addresses.
- Will properly handle URLs with query params, anchors,
and special characters, and not include chars like a trailing . at the end)
of a sentence, or a char if the URL is inside parenenthesis.google.com/#anchor
- Will autolink email addresses
- Will autolink phone numbers
- Will autolink mentions (Twitter, Instagram, Soundcloud, TikTok, Youtube)
- Will autolink hashtags (Twitter, Instagram, Facebook, TikTok, Youtube)
- Won't clobber URLs with hash anchors by treating them as hashtags (like some other libraries do). For example: is properly linked.href
- Will properly handle HTML input. The utility will not overwrite an
attribute inside anchor () tags (or any other tag/attribute), and will not
accidentally wrap the inner text of /
`
Using the static link()
method:
`javascript`
const linkedText = Autolinker.link(textToAutolink[, options]);
Using as a class:
`javascript
const autolinker = new Autolinker([ options ]);
const linkedText = autolinker.link(textToAutoLink);
`
Note: if using the same options to autolink multiple pieces of html/text, it's
slightly more efficient to create a single Autolinker instance, and run the
link()
method repeatedly (i.e. use the "class" form above).
#### Examples:
`javascript
const linkedText = Autolinker.link("Check out google.com");
// Produces: "Check out google.com"
const linkedText = Autolinker.link("Check out google.com", {
newWindow: false
});
// Produces: "Check out google.com"
`
The following are the options which may be specified for linking. These are
specified by providing an Object as the second parameter to Autolinker.link().
These include:
- newWindow : boolean
true to have the links should open in a new window when clicked, falsetrue
otherwise. Defaults to .true
- urls : boolean/Object
to have URLs auto-linked, false to skip auto-linking of URLs. Defaults true
to .
This option also accepts an Object form with 3 properties to allow for
more customization of what exactly gets linked. All default to true:
- schemeMatches (boolean): true to match URLs found prefixed with a scheme,http://google.com
i.e. , or other+scheme://google.com, false totrue
prevent these types of matches.
- tldMatches: to match URLs with known top level domains (.com, .net,google.com
etc.) that are not prefixed with a scheme (i.e. 'http://'). Ex: ,asdf.org/?page=1
, etc. Set to false to prevent these types of matches.true
- ipV4Matches (boolean): to match IPv4 addresses. Ex: 192.168.0.1.false
to prevent these types of matches. Note that if the IP address had schemeMatches
a prefixed scheme (such as 'http://'), and is true, it
will still be linked.
Example usage: urls: { schemeMatches: true, tldMatches: false, ipV4Matches: true }
- email : boolean
true to have email addresses auto-linked, false to skip auto-linking oftrue
email addresses. Defaults to .true
- phone : boolean
to have phone numbers auto-linked, false to skip auto-linking oftrue
phone numbers. Defaults to .
- mention : string
A string for the service name to have mentions (@username) auto-linked to. Supported values at this time are 'twitter', 'soundcloud', 'instagram', 'tiktok', and 'youtube'. Pass false to skip auto-linking of mentions. Defaults to false.
- hashtag : boolean/string
A string for the service name to have hashtags auto-linked to. Supported values at this time are 'twitter', 'facebook', 'instagram', 'tiktok', and 'youtube'. Pass false to skip auto-linking of hashtags. Defaults to false.
- stripPrefix : boolean
true to have the 'http://' (or 'https://') and/or the 'www.' false
stripped from the beginning of displayed links, otherwise. true
Defaults to .true
This option also accepts an Object form with 2 properties to allow for
more customization of what exactly is prevented from being displayed.
Both default to :
- scheme (boolean): true to prevent the scheme part of a URL match'http://google.com'
from being displayed to the user. Example: 'google.com'
will be displayed as . false to not strip the 'http://'
scheme. NOTE: Only an or 'https://' scheme will be'file://'
removed, so as not to remove a potentially dangerous scheme (such
as or 'javascript:').true
- www (boolean): to prevent the 'www.' part of a URL match'www.google.com'
from being displayed to the user. Ex: will be'google.com'
displayed as . false to not strip the 'www'.
- stripTrailingSlash : boolean
true to remove the trailing slash from URL matches, false to keeptrue
the trailing slash. Example when : http://google.com/ will be http://google.com
displayed as . Defaults to true.
- truncate : number/Object
A number for how many characters long URLs/emails/Twitter handles/Twitter
hashtags should be truncated to inside the text of a link. If the match is
over the number of characters, it will be truncated to this length by
replacing the end of the string with a two period ellipsis ('..').
Example: a url like 'http://www.yahoo.com/some/long/path/to/a/file' truncated
to 25 characters may look like this: 'yahoo.com/some/long/pat..'
In the object form, both length and location may be specified to performlocation
truncation. Available options for are: 'end' (default), 'middle',
or 'smart'. Example usage:
`javascript`
truncate: { length: 32, location: 'middle' }
The 'smart' truncation option is for URLs where the algorithm attempts to
strip out unnecessary parts of the URL (such as the 'www.', then URL scheme,
hash, etc.) before trying to find a good point to insert the ellipsis if it is
still too long. For details, see source code of:
TruncateSmart
- className : string
A CSS class name to add to the generated anchor tags. This class will be added
to all links, as well as this class plus "url"/"email"/"phone"/"hashtag"/"mention"
suffixes for styling url/email/phone/hashtag/mention links differently.
The name of the hashtag/mention service is also added as a CSS class for those
types of matches.
For example, if this config is provided as "my-link", then:
- URL links will have the CSS classes: "my-link my-link-url"
- Email links will have the CSS classes: "my-link my-link-email"
- Phone links will have the CSS classes: "my-link my-link-phone"
- Twitter mention links will have the CSS classes: "my-link my-link-mention my-link-twitter"
- Instagram mention links will have the CSS classes: "my-link my-link-mention my-link-instagram"
- Hashtag links will have the CSS classes: "my-link my-link-hashtag my-link-twitter"
- decodePercentEncoding: boolean
true to decode percent-encoded characters in URL matches, false to keeptrue
the percent-encoded characters.
Example when : https://en.wikipedia.org/wiki/San_Jos%C3%A9 willhttps://en.wikipedia.org/wiki/San_José
be displayed as .true
Defaults to .
- replaceFn : Function
A function to use to programmatically make replacements of matches in the
input string, one at a time. See the section
Custom Replacement Function for
more details.
- sanitizeHtml : boolean
true to HTML-encode the start and end brackets of existing HTML tags found <
in the input string. This will escape and > characters to < and >
, respectively.true
Setting this to will prevent XSS (Cross-site Scripting) attacks, false
but will remove the significance of existing HTML tags in the input string. If
you would like to maintain the significance of existing HTML tags while also
making the output HTML string safe, leave this option as and use a
tool like https://github.com/cure53/DOMPurify (or others) on the input string
before running Autolinker.
Defaults to false.
For example, if you wanted to disable links from opening in new windows, you could do:
`javascript`
const linkedText = Autolinker.link("Check out google.com", {
newWindow: false
});
// Produces: "Check out google.com"
And if you wanted to truncate the length of URLs (while also not opening in a new window), you could do:
`javascript`
const linkedText = Autolinker.link("http://www.yahoo.com/some/long/path/to/a/file", {
truncate: 25,
newWindow: false
});
// Produces: "yahoo.com/some/long/pat.."
One could update an entire DOM element that has unlinked text to auto-link them
as such:
`javascript`
const myTextEl = document.getElementById('text');
myTextEl.innerHTML = Autolinker.link(myTextEl.innerHTML);
Using the same pre-configured Autolinker
instance in multiple locations of a codebase (usually by dependency injection):
`javascript
const autolinker = new Autolinker({ newWindow: false, truncate: 25 });
//...
autolinker.link("Check out http://www.yahoo.com/some/long/path/to/a/file");
// Produces: "Check out yahoo.com/some/long/pat.."
//...
autolinker.link( "Go to www.google.com" );
// Produces: "Go to google.com"
`
If you're just interested in retrieving the list of Matches without producing a transformed string, you can use the parse() method.
For example:
`
const matches = Autolinker.parse("Hello google.com, I am asdf@asdf.com", {
urls: true,
email: true
});
console.log(matches.length); // 2
console.log(matches[0].type); // 'url'
console.log(matches[0].getUrl()); // 'google.com'
console.log(matches[1].type); // 'email'
console.log(matches[1].getEmail()); // 'asdf@asdf.com'
`
A custom replacement function (replaceFn)
may be provided to replace url/email/phone/mention/hashtag matches on an
individual basis, based on the return from this function.
#### Full example, for purposes of documenting the API:
`javascript
const input = "..."; // string with URLs, Email Addresses, Mentions (Twitter, Instagram), and Hashtags
const linkedText = Autolinker.link(input, {
replaceFn : function(match) {
console.log("href = ", match.getAnchorHref());
console.log("text = ", match.getAnchorText());
switch(match.type) {
case 'url':
console.log("url: ", match.getUrl());
return true; // let Autolinker perform its normal anchor tag replacement
case 'email':
const email = match.getEmail();
console.log("email: ", email);
if(email === "my@own.address") {
return false; // don't auto-link this particular email address; leave as-is
} else {
return; // no return value will have Autolinker perform its normal anchor tag replacement (same as returning true)
}
case 'phone':
console.log("Phone Number: ", match.getPhoneNumber());
return '' + match.getPhoneNumber() + '';
case 'mention':
console.log("Mention: ", match.getMention());
console.log("Mention Service Name: ", match.getServiceName());
return '' + match.getMention() + '';
case 'hashtag':
console.log("Hashtag: ", match.getHashtag());
return '' + match.getHashtag() + '';
}
}
} );
`
#### Modifying the default generated anchor tag
`javascript
const input = "..."; // string with URLs, Email Addresses, Mentions (Twitter, Instagram), and Hashtags
const linkedText = Autolinker.link( input, {
replaceFn : function( match ) {
console.log("href = ", match.getAnchorHref());
console.log("text = ", match.getAnchorText());
const tag = match.buildTag(); // returns an Autolinker.HtmlTag instance for an tag
tag.setAttr('rel', 'nofollow'); // adds a 'rel' attribute
tag.addClass('external-link'); // adds a CSS class
tag.setInnerHtml('Click here!'); // sets the inner html for the anchor tag
return tag;
}
} );
`
The replaceFn is provided one argument:
1. An Autolinker.match.Match
object which details the match that is to be replaced.
A replacement of the match is made based on the return value of the function.
The following return values may be provided:
1. No return value (undefined), or true (boolean): Delegate back tofalse
Autolinker to replace the match as it normally would.
2. (boolean): Do not replace the current match at all - leave as-is.
3. Any string: If a string is returned from the function, the string will be used
directly as the replacement HTML for the match.
4. An Autolinker.HtmlTag
instance, which can be used to build/modify an HTML tag before writing out its
HTML text.
The full API docs for Autolinker may be referenced at:
http://gregjacobs.github.io/Autolinker.js/api/
http://gregjacobs.github.io/Autolinker.js/examples/
1. Internet Explorer support has been removed since its official demise in June
2022.
1. The urls.wwwMatches config has been removed. A www. prefix is now treatedMatch.getType()
like any other subdomain of a top level domain (TLD) match (such as
'subdomain.google.com').
1. should be replaced with Match.type. This allows for Match
TypeScript type narrowing of objects returned by the parse() replaceFn
method or inside the .Matcher
1. The classes have been removed in favor of a single finite stateAutolinker.AnchorTagBuilder
machine parser, greatly improving the performance of Autolinker (3x
performance improvement over the 3.x branch), but removing some of the
customizability of the old regular expressions. Will address this
customizability in a future release.
1. , Autolinker.HtmlTag, and Autolinker.match.*
references have been removed. These shouldn't be needed as public APIs, but
please raise a GitHub issue if these are for some reason needed.
1. If you are still on v1.x, first follow the instructions in the
Upgrading from v1.x -> v2.x
section below.
2. The HtmlParser class has been removed in favor of an internal parseHtml()HtmlParser
function which replaces the old regexp-based implementation with a state
machine parser that is guaranteed to run in linear time. If you were using
the class directly, I recommend switching to htmlparser2, which implements the HTML semantics parseHtml()
better. The internal function that Autolinker now uses is
fairly geared towards Autolinker's purposes, and may not be useful in a
general HTML parsing sense.
1. If you are still on v0.x, first follow the instructions in the
Upgrading from v0.x -> v1.x
section below.
2. The codebase has been converted to TypeScript, and uses ES6 exports. You can
now use the import statement to pull in the Autolinker class and related Match
entities such as :
`ts`
// ES6/TypeScript/Webpack
import Autolinker, { Match } from 'autolinker';
The require() interface is still supported as well for Node.js:
`ts`
// Node.js
const Autolinker = require('autolinker');
3. You will no longer need the @types/autolinker package as this package nowMatcher
exports its own types
4. You will no longer be able to override the regular expressions in the
classes by assigning to the prototype (for instance, something likePhoneMatcher.prototype.regex = ...
). This is due to how TypeScript creates Matcher
properties for class instances in the constructor rather than on prototypes.
The idea of providing your own regular expression for these classes is a
brittle notion anyway, as the classes rely on capturing groups in Matcher
the RegExp being in the right place, or even multiple capturing groups for
the same piece of information to support a different format. These capturing
groups and associated code are subject to change as the regular expression
needs to be updated, and will not involve a major version release of
Autolinker.
In the future you will be able to override the default classes
entirely to provide your own implementation, but please raise an issue (or
+1 an issue) if you think the library should support a currently-unsupported
format.
1. twitter option removed, replaced with mention (which accepts 'twitter', twitter
'instagram' and 'soundcloud' values)
2. Matching mentions (previously the option) now defaults toreplaceFn
being turned off. Previously, Twitter handle matching was on by
default.
3. option now called with just one argument: the Match autolinker
object (previously was called with two arguments: and match
)replaceFn
4. (Used inside the ) TwitterMatch replaced with MentionMatch
, and MentionMatch.getType() now returns 'mention' 'twitter'
instead of replaceFn
5. (Used inside the ) TwitterMatch.getTwitterHandle() -> MentionMatch.getMention()
Pull requests definitely welcome. To setup the project, make sure you have
Node.js installed. Then open up a command prompt and type
the following:
`sh
npm install -g pnpm@latest # this project uses pnpm workspaces, and pnpm is a faster npm anyway :)
cd Autolinker.js # where you cloned the project
pnpm install
`
To run the tests:
`sh`
pnpm run test
- Make sure to add tests to check your new functionality/bugfix
- Run the pnpm run test command to test
#### Running the Live Example Page Locally
Run:
`sh`
pnpm run devserver
Then open your browser to: http://localhost:8080/docs/examples/index.html
You should be able to make a change to source files, and refresh the page to see
the changes.
Run:
`sh`
pnpm run benchmarks
> Note: See the Benchmarks Table above for current results.
Couple points on the benchmarks:
* These benchmarks attempt to set up all libraries by configuring comparable features to Autolinker (e.g.: linking emails, hashtags, mentions, etc.) to try to get an apples-to-apples comparison.
* While developing, recommend running the benchmarks a few times both before and after making any changes if developing.
* See the benchmarks folder and benchmarks/input-text.ts for how the benchmarks are set up and what input they are given.
#### Documentation Generator Notes
This project uses JSDuck for its
documentation generation, which produces the page at http://gregjacobs.github.io/Autolinker.js.
Unfortunately, JSDuck is a very old project that is no longer maintained. As
such, it doesn't support TypeScript or anything from ES6 (the class keyword,
arrow functions, etc). However, I have yet to find a better documentation
generator that creates such a useful API site. (Suggestions for a new one are
welcome though - please raise an issue.)
Since ES6 is not supported, we must generate the documentation from the ES5
output. As such, a few precautions must be taken care of to make sure the
documentation comes out right:
1. @cfg documentation tags must exist above a class property that has a
default value, or else it won't end up in the ES5 output. For example:
`ts
// Will correctly end up in the ES5 output
/**
* @cfg {String} title
*/
readonly title: string = '';
// Will not end up in ES5 output, and thus, won't end up in the generated
// documentation
/**
* @cfg {String} title
*/
readonly title: string;
`@constructor
2. The tag must be replaced with @method constructor`
To build the documentation, you will need Ruby
installed (note: Ruby comes pre-installed on MacOS), with the
JSDuck gem.
See https://github.com/senchalabs/jsduck#getting-it for installation
instructions on Windows/Mac/Linux.
See Releases
[1]: https://github.com/alexcorvi/anchorme.js "anchorme.js"
[2]: https://linkify.js.org/docs/linkify-html.html "linkify-html"
[3]: https://github.com/markdown-it/linkify-it "linkify-it"