下面的示例将HTML解析为Document对象后使用方法获取文本元素信息。
Document document=Jsoup.parse(html); Element link=document.select("a").first(); System.out.println("Text: " + link.text());
元素对象代表dom元素,并提供各种方法来获取dom元素的文本。
Element.text 示例
使用您选择的任何编辑器在C:/> jsoup中创建以下Java程序。
JsoupTester.java
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
public class JsoupTester {
public static void main(String[] args) {
</span><span class="typ">String</span><span class="pln"> html </span><span class="pun">=</span><span class="pln"> </span><span class="str">"<html><head><title>Sample Title</title></head>"</span><span class="pln">
</span><span class="pun">+</span><span class="pln"> </span><span class="str">"<body>"</span><span class="pln">
</span><span class="pun">+</span><span class="pln"> </span><span class="str">"<p>Sample Content</p>"</span><span class="pln">
</span><span class="pun">+</span><span class="pln"> </span><span class="str">"<div id=sampleDiv><a href=www.google.com>Google</a>"</span><span class="pln">
</span><span class="pun">+</span><span class="pln"> </span><span class="str">"<h3><a>Sample</a><h3>"</span><span class="pln">
</span><span class="pun">+</span><span class="str">"</div>"</span><span class="pln">
</span><span class="pun">+</span><span class="str">"</body></html>"</span><span class="pun">;</span><span class="pln">
</span><span class="typ">Document</span><span class="pln"> document </span><span class="pun">=</span><span class="pln"> </span><span class="typ">Jsoup</span><span class="pun">.</span><span class="pln">parse</span><span class="pun">(</span><span class="pln">html</span><span class="pun">);</span><span class="pln">
</span><span class="com">//a with href</span><span class="pln">
</span><span class="typ">Element</span><span class="pln"> link </span><span class="pun">=</span><span class="pln"> document</span><span class="pun">.</span><span class="kwd">select</span><span class="pun">(</span><span class="str">"a"</span><span class="pun">).</span><span class="pln">first</span><span class="pun">();</span><span class="pln">
</span><span class="typ">System</span><span class="pun">.</span><span class="kwd">out</span><span class="pun">.</span><span class="pln">println</span><span class="pun">(</span><span class="str">"Text: "</span><span class="pln"> </span><span class="pun">+</span><span class="pln"> link</span><span class="pun">.</span><span class="pln">text</span><span class="pun">());</span><span class="pln">
}
}
使用 javac 编译器编译类,如下所示:
C:\jsoup>javac JsoupTester.java
现在运行JsoupTester以查看输出。
C:\jsoup>java JsoupTester
查看输出。
Text: Google