{"id":5229,"date":"2022-04-25T00:20:50","date_gmt":"2022-04-24T16:20:50","guid":{"rendered":"https:\/\/ixyzero.com\/blog\/?p=5229"},"modified":"2022-04-25T00:20:50","modified_gmt":"2022-04-24T16:20:50","slug":"%e7%94%a8-docx4j-%e5%af%b9docx%e6%96%87%e6%a1%a3%e8%bf%9b%e8%a1%8c%e7%ae%80%e5%8d%95%e8%af%bb%e5%86%99","status":"publish","type":"post","link":"https:\/\/ixyzero.com\/blog\/archives\/5229.html","title":{"rendered":"\u7528 docx4j \u5bf9docx\u6587\u6863\u8fdb\u884c\u7b80\u5355\u8bfb\u5199"},"content":{"rendered":"\n<p>=Start=<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">\u7f18\u7531\uff1a<\/h4>\n\n\n\n<p>\u7b80\u5355\u8bb0\u5f55\u4e00\u4e0b\u5982\u4f55\u7528 docx4j \u5bf9\u5e38\u89c1\u7684Office\u683c\u5f0f\u6587\u6863\uff08\u6b64\u5904\u4ee5docx\u6587\u4ef6\u4e3a\u4f8b\uff09\u8fdb\u884c\u8bfb\u5199\u64cd\u4f5c\u3002<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">\u6b63\u6587\uff1a<\/h4>\n\n\n\n<h5 class=\"wp-block-heading\">\u53c2\u8003\u89e3\u7b54\uff1a<\/h5>\n\n\n\n<p>\u4ece\u96f6\u521b\u5efadocx\u6587\u4ef6\u7684\u65b9\u6cd5\u5728\u524d\u9762\u7684\u6587\u7ae0\u4e2d\u6709\u8bb0\u5f55\uff0c\u8fd9\u91cc\u4e0d\u91cd\u590d\u5199\u4e86\uff0c\u53ea\u8bb0\u5f55\u4e00\u4e0b\u8bfb\u53d6docx\u6587\u4ef6\u5185\u5bb9\u7684\u65b9\u6cd5\uff0c\u6838\u5fc3\u5176\u5b9e\u5c31\u662f\u904d\u5386\u5b58\u50a8\u4e86\u6587\u6863\u5185\u5bb9\u7684w:t\u8282\u70b9\uff0c\u4f46\u5bf9\u4e8e\u6211\u8fd9\u79cd\u521a\u63a5\u89e6\u7684\u65b0\u624b\u6765\u8bf4\u4e00\u5f00\u59cb\u4e0d\u77e5\u9053\u600e\u4e48\u505a\u624d\u662f\u6700\u5927\u7684\u5165\u95e8\u969c\u788d\u3002<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>package com.example;\n\nimport com.example.rewrite.BinaryPartAbstractImageExt;\nimport org.docx4j.dml.wordprocessingDrawing.Inline;\nimport org.docx4j.jaxb.Context;\nimport org.docx4j.jaxb.XPathBinderAssociationIsPartialException;\nimport org.docx4j.openpackaging.exceptions.Docx4JException;\nimport org.docx4j.openpackaging.packages.WordprocessingMLPackage;\nimport org.docx4j.openpackaging.parts.WordprocessingML.BinaryPartAbstractImage;\nimport org.docx4j.openpackaging.parts.WordprocessingML.MainDocumentPart;\nimport org.docx4j.wml.*;\n\nimport javax.xml.bind.JAXBElement;\nimport javax.xml.bind.JAXBException;\nimport java.io.File;\nimport java.math.BigInteger;\nimport java.net.MalformedURLException;\nimport java.net.URL;\nimport java.util.List;\n\n\/**\n * @author ixyzero\n * Created on 2022-04-23\n *\/\npublic class opOfficeTextMark {\n    private static boolean isDebug = false;\n\n    public static void main(String&#91;] args) {\n        String filePath = \"20220423.docx\";\n        docx4jCodeExample.docxCreate(filePath, \"add some content here.\");\n\n        WordprocessingMLPackage wordMLPackage = null;\n        try {\n            wordMLPackage = WordprocessingMLPackage.load(new File(filePath));\n        } catch (Docx4JException e) {\n            e.printStackTrace();\n        }\n        printWordContent(wordMLPackage);\n\n        String markStr = \"this is hidden text.\";\n        textMark(wordMLPackage, markStr);\n\n        try {\n            wordMLPackage.save(new File(String.format(\"new-%s\",filePath)));\n        } catch (Docx4JException e) {\n            e.printStackTrace();\n        }\n        printWordContent(wordMLPackage);\n        return;\n    }\n\n    \/**\n     * Use XPath expression (\/\/w:t) to get all text nodes from the main document part.\n     * &lt;w:p> \u8868\u793a\u6bb5\u843d\u683c\u5f0f\u3001&lt;w:r> \u8868\u793a\u5b57\u7b26\u683c\u5f0f\u3001&lt;w:t> \u8868\u793a\u6587\u672c\u5185\u5bb9\u3002\n     * \u5176\u4e2d\uff0c&lt;w:t> \u5305\u542b\u5728 &lt;w:r> \u4f53 \u4e2d\uff0c&lt;w:r> \u5305 \u542b \u5728 &lt;w:p> \u4f53 \u4e2d\u3002\n     * \u5373\u4fbf &lt;w:r> \u6709 &lt;w:vanish\/> \u7b49\u5c5e\u6027\uff08\u9690\u85cf\u6587\u672c\uff09\uff0c\u4f46\u662f\u8fd9\u91cc\u76f4\u63a5\u62ff\u7684 &lt;w:t> \u6ca1\u6709\u7ba1\u5c5e\u6027\u95ee\u9898\uff0c\u6240\u4ee5\u53ef\u4ee5\u62ff\u5168\u3002\n     *\/\n    private static void printWordContent(WordprocessingMLPackage wordMLPackage) {\n        MainDocumentPart mainDocumentPart = wordMLPackage.getMainDocumentPart();\n        String textNodesXPath = \"\/\/w:t\";\n        List&lt;Object> textNodes= null;\n        try {\n            textNodes = mainDocumentPart.getJAXBNodesViaXPath(textNodesXPath, true);\n        } catch (JAXBException e) {\n            e.printStackTrace();\n        } catch (XPathBinderAssociationIsPartialException e) {\n            e.printStackTrace();\n        }\n        for (Object obj : textNodes) {\n            Text text = (Text) ((JAXBElement) obj).getValue();\n            String textValue = text.getValue();\n            System.out.println(textValue);\n        }\n\n        return;\n    }\n\n    \/\/ \u7ed9\u4f20\u5165\u7684 docx \u6587\u6863\u5728\u7b2c\u4e00\u6bb5\u548c\u6700\u540e\u4e00\u6bb5\u6dfb\u52a0\u9690\u85cf\u6587\u672c\n    private static void textMark(WordprocessingMLPackage docxPackage, String markStr) {\n        P firstP = getFirstParagraph(docxPackage);\n        P lastP = getLastParagraph(docxPackage);\n\n        Text text = Context.getWmlObjectFactory().createText();\n        text.setValue(markStr);\n\n        R run= Context.getWmlObjectFactory().createR();\n        RPr runProperties = new RPr();\n        runProperties.setVanish(new BooleanDefaultTrue());\n        HpsMeasure sz = new HpsMeasure();\n        sz.setVal(BigInteger.valueOf(1L));\n        runProperties.setSz(sz);\n        run.setRPr(runProperties);\n        run.getContent().add(text);\n\n        firstP.getContent().add(run);\n        if(firstP != lastP) {\n            lastP.getContent().add(run);\n        }\n    }\n\n    private static P getFirstParagraph(WordprocessingMLPackage wordMLPackage) {\n        for (Object obj : wordMLPackage.getMainDocumentPart().getContent()) {\n            if (obj instanceof P) {\n                return ((P) obj);\n            }\n        }\n        \/\/ create if not found\n        return Context.getWmlObjectFactory().createP();\n    }\n\n    private static P getLastParagraph(WordprocessingMLPackage wordMLPackage) {\n        List&lt;Object> contents = wordMLPackage.getMainDocumentPart().getContent();\n        for (int i = contents.size()-1; i>=0; i--) {\n            if (contents.get(i) instanceof P) {\n                return ((P) contents.get(i));\n            }\n        }\n        \/\/ create if not found\n        return Context.getWmlObjectFactory().createP();\n    }\n}\n<\/code><\/pre>\n\n\n\n<p><\/p>\n\n\n\n<h5 class=\"wp-block-heading\">\u53c2\u8003\u94fe\u63a5\uff1a<\/h5>\n\n\n\n<p>Introduction To Docx4J<br><a href=\"https:\/\/www.baeldung.com\/docx4j\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.baeldung.com\/docx4j<\/a><\/p>\n\n\n\n<p>\u5982\u4f55\u4f7f\u7528 docx4j \u4e2d\u7684 getJaxbElement \u65b9\u6cd5<br><a href=\"https:\/\/www.tabnine.com\/code\/java\/methods\/org.docx4j.openpackaging.parts.DocPropsCustomPart\/getJaxbElement\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.tabnine.com\/code\/java\/methods\/org.docx4j.openpackaging.parts.DocPropsCustomPart\/getJaxbElement<\/a><\/p>\n\n\n\n<p>Show all text of a docx in a stringBuilder with docx4j<br><a href=\"https:\/\/stackoverflow.com\/questions\/26117645\/show-all-text-of-a-docx-in-a-stringbuilder-with-docx4j\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/stackoverflow.com\/questions\/26117645\/show-all-text-of-a-docx-in-a-stringbuilder-with-docx4j<\/a><br><a href=\"https:\/\/www.docx4java.org\/forums\/docx-java-f6\/is-it-possible-to-extract-all-text-also-tab-and-hyphen-t1996.html\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.docx4java.org\/forums\/docx-java-f6\/is-it-possible-to-extract-all-text-also-tab-and-hyphen-t1996.html<\/a><\/p>\n\n\n\n<p>I am using docx4j for reading .docx files and I need to get the paragraph of a document and replace strings<br><a href=\"https:\/\/stackoverflow.com\/questions\/13199900\/i-am-using-docx4j-for-reading-docx-files-and-i-need-to-get-the-paragraph-of-a-d\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/stackoverflow.com\/questions\/13199900\/i-am-using-docx4j-for-reading-docx-files-and-i-need-to-get-the-paragraph-of-a-d<\/a><\/p>\n\n\n\n<p>\u5229\u7528docx4j\u5b8c\u7f8e\u5bfc\u51faword\u6587\u6863\uff08\u6807\u7b7e\u66ff\u6362\u3001\u63d2\u5165\u56fe\u7247\u3001\u751f\u6210\u8868\u683c\uff09<br><a href=\"https:\/\/blog.csdn.net\/qq_31905135\/article\/details\/80431042\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/blog.csdn.net\/qq_31905135\/article\/details\/80431042<\/a><\/p>\n\n\n\n<p>java \u64cd\u4f5cword(docx4j)<br><a href=\"https:\/\/www.cnblogs.com\/marydon20170307\/p\/14757039.html\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.cnblogs.com\/marydon20170307\/p\/14757039.html<\/a><br><a href=\"https:\/\/blog.csdn.net\/weixin_34295316\/article\/details\/86022702\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/blog.csdn.net\/weixin_34295316\/article\/details\/86022702<\/a><\/p>\n\n\n\n<p>java docx4j_docx4j\u57fa\u672c\u64cd\u4f5c<br><a href=\"https:\/\/blog.csdn.net\/weixin_28932161\/article\/details\/114308517\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/blog.csdn.net\/weixin_28932161\/article\/details\/114308517<\/a><\/p>\n\n\n\n<p>Java \u4f7f\u7528Docx4j\u5b9e\u73b0word\u6587\u6863Docx\u683c\u5f0f\u8f6cDoc\u683c\u5f0f<br><a href=\"https:\/\/blog.csdn.net\/qq_41394352\/article\/details\/123302153\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/blog.csdn.net\/qq_41394352\/article\/details\/123302153<\/a><\/p>\n\n\n\n<p>\u5229\u7528docx4j word\u8f6cpdf<br><a href=\"https:\/\/www.csdn.net\/tags\/MtTaMgysMzg3ODAtYmxvZwO0O0OO0O0O.html\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.csdn.net\/tags\/MtTaMgysMzg3ODAtYmxvZwO0O0OO0O0O.html<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/github.com\/plutext\/docx4j\/tree\/master\/docs\">https:\/\/github.com\/plutext\/docx4j\/tree\/master\/docs<\/a><\/p>\n\n\n\n<p>=END=<\/p>\n","protected":false},"excerpt":{"rendered":"<p>=Start= \u7f18\u7531\uff1a \u7b80\u5355\u8bb0\u5f55\u4e00\u4e0b\u5982\u4f55\u7528 docx4j \u5bf9\u5e38\u89c1\u7684Office\u683c\u5f0f\u6587\u6863\uff08\u6b64\u5904\u4ee5docx\u6587\u4ef6\u4e3a\u4f8b [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[23,7],"tags":[1803,1802,61],"class_list":["post-5229","post","type-post","status-publish","format-standard","hentry","category-knowledgebase-2","category-programing","tag-docx","tag-docx4j","tag-java"],"views":3835,"_links":{"self":[{"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/posts\/5229","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/comments?post=5229"}],"version-history":[{"count":1,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/posts\/5229\/revisions"}],"predecessor-version":[{"id":5230,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/posts\/5229\/revisions\/5230"}],"wp:attachment":[{"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/media?parent=5229"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/categories?post=5229"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ixyzero.com\/blog\/wp-json\/wp\/v2\/tags?post=5229"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}