程式扎記: [ Java 代碼範本 ] jsoup - Extract attributes, text, and HTML from elements

2015年10月25日星期日

[ Java 代碼範本 ] jsoup - Extract attributes, text, and HTML from elements

Source From Here
Problem
After parsing a document, and finding some elements, you'll want to get at the data inside those elements.

Solution

* To get the value of an attribute, use the Node.attr(String key) method
* For the text on an element (and its combined children), use Element.text()
* For HTML, use Element.html(), or Node.outerHtml() as appropriate

For example:

view plaincopy to clipboardprint?
String html = "An example link.

";  
Document doc = Jsoup.parse(html);  
Element link = doc.select("a").first();  
  
String text = doc.body().text(); // "An example link"  
String linkHref = link.attr("href"); // "http://example.com/"  
String linkText = link.text(); // "example""  
  
String linkOuterH = link.outerHtml();   
    // "example"  
String linkInnerH = link.html(); // "example"  

Description
The methods above are the core of the element data access methods. There are additional others:

* Element.id()
* Element.tagName()
* Element.className() and Element.hasClass(String className)

All of these accessor methods have corresponding setter methods to change the data.

See also
* The reference documentation for Element and the collection Elements class
* Working with URLs
* finding elements with the CSS selector syntax

程式扎記

標籤

2015年10月25日星期日

[ Java 代碼範本 ] jsoup - Extract attributes, text, and HTML from elements

沒有留言:

張貼留言

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

標籤

2015年10月25日 星期日

[ Java 代碼範本 ] jsoup - Extract attributes, text, and HTML from elements

沒有留言:

張貼留言

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

2015年10月25日星期日