Skip to content

MetaScraper is a extracts metadata information of a website.

License

Notifications You must be signed in to change notification settings

mehmetcansahin/metascraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MetaScraper

Rust crates.io Released API docs MIT licensed

MetaScraper is a extracts metadata information of a website.

MetaScraper uses tl as its html parser. This choice was made because tl was the fastest in benchmark tests. For more information, visit the parse_query_bench GitHub page.

Usage

let input = include_str!("test.html");
let metascraper = MetaScraper::parse(input).unwrap();
println!("{}", metascraper.title().unwrap());
// Title
let metadata = metascraper.metadata();
println!("{:?}", metadata);
// MetaData {
//     title: Some("Title"),
//     description: Some("Description"),
//     canonical: Some("https://mehmetcan.sahin.dev"),
//     language: Some("en"),
//     rss: Some("rss.xml"),
//     metatags: Some([
//         Metatag { name: "X-UA-Compatible", content: "IE=edge" },
//         Metatag { name: "viewport", content: "width=device-width, initial-scale=1.0" },
//         Metatag { name: "description", content: "Description" }
//         ])
// }

License

MetaScraper is licensed under the MIT License.

About

MetaScraper is a extracts metadata information of a website.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published