下面的示例将解析HTML为新的文档(Document),然后获取html body的Element对象。
Document document=Jsoup.parseBodyFragment(html); Element body=document.body();
parseBodyFragment(String html)方法将输入的HTML解析为新的文档,该文档对象可用于遍历并获取html正文片段的详细信息。
Document.body 示例
使用您选择的任何编辑器在C:/> jsoup中创建以下Java程序。
JsoupTester.java
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;
public class JsoupTester {
public static void main(String[] args) {
</span><span class="typ">String</span><span class="pln"> html </span><span class="pun">=</span><span class="pln"> </span><span class="str">"<div><p>Sample Content</p>"</span><span class="pun">;</span><span class="pln">
</span><span class="typ">Document</span><span class="pln"> document </span><span class="pun">=</span><span class="pln"> </span><span class="typ">Jsoup</span><span class="pun">.</span><span class="pln">parseBodyFragment</span><span class="pun">(</span><span class="pln">html</span><span class="pun">);</span><span class="pln">
</span><span class="typ">Element</span><span class="pln"> body </span><span class="pun">=</span><span class="pln"> document</span><span class="pun">.</span><span class="pln">body</span><span class="pun">();</span><span class="pln">
</span><span class="typ">Elements</span><span class="pln"> paragraphs </span><span class="pun">=</span><span class="pln"> body</span><span class="pun">.</span><span class="pln">getElementsByTag</span><span class="pun">(</span><span class="str">"p"</span><span class="pun">);</span><span class="pln">
</span><span class="kwd">for</span><span class="pln"> </span><span class="pun">(</span><span class="typ">Element</span><span class="pln"> paragraph </span><span class="pun">:</span><span class="pln"> paragraphs</span><span class="pun">)</span><span class="pln"> </span><span class="pun">{</span><span class="pln">
</span><span class="typ">System</span><span class="pun">.</span><span class="kwd">out</span><span class="pun">.</span><span class="pln">println</span><span class="pun">(</span><span class="pln">paragraph</span><span class="pun">.</span><span class="pln">text</span><span class="pun">());</span><span class="pln">
</span><span class="pun">}</span><span class="pln">
}
}
使用 javac 编译器编译类,如下所示:
C:\jsoup>javac JsoupTester.java
现在运行JsoupTester以查看输出。
C:\jsoup>java JsoupTester
查看输出。
Sample Content