jsoup
https://github.com/jhy/jsoup
Java
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Java not yet supported39 Subscribers
View all SubscribersAdd a CodeTriage badge to jsoup
Help out
- Issues
- Support Pretty Printer subclassing
- Cleaner & Safelist API revamp
- Should wholeText() introduce newlines between block elements?
- Timeout is effectively half of what it should be
- Element factory method
- Whitelist.addProtocols() cannot only allow base64 image instead of all data uri
- How to encode illegal tag in html body
- Create a setter method to sets Connection read timeout
- Feature Request - Create a setter method to sets the encoding when parsing the response as a Document
- Add method Elements.before(Node node)
- Docs
- Java not yet supported