OpenHTMLToPDF:将自定义字体嵌入到 HTML 创建的 PDF 中
OpenHTMLToPDF: Embed a custom font into PDF created out of HTML
我使用 Jsoup 和 OpenHTMLToPDF. I have to use a different font in my PDF to have non-latin glyphcs covered (see here 从 HTML 创建了一个 PDF。如何正确嵌入我的字体?
重现问题的简化程序:
src/main/resources/test.html
<!DOCTYPE html>
<html>
<head>
<meta charset="UTF-8" />
<title>Font Test</title>
<style>
@font-face {
font-family: 'source-sans';
font-style: normal;
font-weight: 400;
src: url(fonts/SourceSansPro-Regular.ttf);
}
</style>
</head>
<body>
<p style="font-family: 'source-sans',serif">Latin Script</p>
<p style="font-family: 'source-sans',serif">Είμαι ελληνικό κείμενο.</p>
</body>
</html>
- 此文件应写成 PDF
- 在浏览器中它看起来正确并使用 Source Sans 字体。
src/main/java/main.java:
import com.openhtmltopdf.extend.FSSupplier;
import com.openhtmltopdf.pdfboxout.PdfRendererBuilder;
import org.jsoup.Jsoup;
import org.jsoup.helper.W3CDom;
import org.w3c.dom.Document;
import java.io.*;
import java.nio.charset.StandardCharsets;
import java.util.Objects;
public class main {
public static void main(String[] args) {
System.out.println("Starting");
try {
final W3CDom w3cDom = new W3CDom();
final Document w3cDoc = w3cDom.fromJsoup(Jsoup.parse(readFile()));
final OutputStream outStream = new FileOutputStream("test.pdf");
final PdfRendererBuilder pdfBuilder = new PdfRendererBuilder();
pdfBuilder.useFastMode();
pdfBuilder.withW3cDocument(w3cDoc, "/");
pdfBuilder.useFont(new File(main.class.getClassLoader().getResource("fonts/SourceSansPro-Regular.ttf").getFile()), "source-sans");
pdfBuilder.toStream(outStream);
pdfBuilder.run();
outStream.close();
} catch (Exception e) {
System.out.println("PDF could not be created: " + e.getMessage());
}
System.out.println("Finish.");
}
private static String readFile() throws IOException {
final ClassLoader classLoader = main.class.getClassLoader();
final InputStream inputStream = classLoader.getResourceAsStream("test.html");
final StringBuilder sb = new StringBuilder();
final Reader r = new InputStreamReader(Objects.requireNonNull(inputStream), StandardCharsets.UTF_8);
char[] buf = new char[1024];
int amt = r.read(buf);
while(amt > 0) {
sb.append(buf, 0, amt);
amt = r.read(buf);
}
return sb.toString();
}
}
- 不用理会第二个函数,它只是读取HTML文件并且只包含在这里,以获得完整的程序。
src/main/resources/fonts/SourceSansPro-regular.ttf
pom.xml
<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>paf</groupId>
<artifactId>test</artifactId>
<version>1.0-SNAPSHOT</version>
<build>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-compiler-plugin</artifactId>
<configuration>
<source>7</source>
<target>7</target>
</configuration>
</plugin>
</plugins>
</build>
<dependencies>
<dependency>
<groupId>com.openhtmltopdf</groupId>
<artifactId>openhtmltopdf-pdfbox</artifactId>
<version>0.0.1-RC18</version>
</dependency>
<dependency>
<groupId>org.jsoup</groupId>
<artifactId>jsoup</artifactId>
<version>1.11.2</version>
</dependency>
</dependencies>
</project>
程序输出:
Starting
com.openhtmltopdf.load INFO:: TIME: parse stylesheets 148ms
com.openhtmltopdf.match INFO:: media = print
com.openhtmltopdf.match INFO:: Matcher created with 147 selectors
com.openhtmltopdf.load INFO:: Loading font(source-sans) from InputStream supplier now.
com.openhtmltopdf.exception WARNING:: bad URL given: /fonts/SourceSansPro-Regular.ttf
com.openhtmltopdf.exception WARNING:: Could not load @font-face font: /fonts/SourceSansPro-Regular.ttf
com.openhtmltopdf.exception WARNING:: Font metrics not available. Probably a bug.
com.openhtmltopdf.exception WARNING:: Font metrics not available. Probably a bug.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
Finish.
生成 PDF
Latin Script
##### ######## #######.
- 衬线字体。
编辑 1:根据评论中链接的页面进行各种更改并更新到 RC18。现在有新的输出,但 PDF 中的字体仍然不正确。
编辑 2:尝试了快速渲染器
好的。感谢@Tilman Hausherr 的评论,我在 GitHub-Issue Tracker of openhtmltopdf and got some help.
中提问
如果有人登陆这里感兴趣,这些更改使其有效:
src/main/java/main.java(仅更改部分,请参阅上面的其余部分):
public static void main(String[] args) {
System.out.println("Starting");
try {
final W3CDom w3cDom = new W3CDom();
final Document w3cDoc = w3cDom.fromJsoup(Jsoup.parse(readFile()));
final OutputStream outStream = new FileOutputStream("test.pdf");
final PdfRendererBuilder pdfBuilder = new PdfRendererBuilder();
pdfBuilder.useFastMode();
pdfBuilder.withW3cDocument(w3cDoc, "/");
pdfBuilder.useFont(new File(main.class.getClassLoader().getResource("fonts/SourceSansPro-Regular.ttf").getFile()), "source-sans");
pdfBuilder.toStream(outStream);
pdfBuilder.run();
outStream.close();
} catch (Exception e) {
System.out.println("PDF could not be created: " + e.getMessage());
}
System.out.println("Finish.");
}
src/main/resources/fonts/SourceSansPro-regular.ttf
来自 src/main/resources/test.html(仅更改部分,请参阅上面的其余部分)
@font-face {
font-family: 'source-sans';
font-style: normal;
font-weight: 400;
src: url(fonts/SourceSansPro-Regular.ttf);
-fs-font-subset: complete-font;
}
以上所有解决方案都试过了,但对我没有用。
然而,使用标记代替 @font-face 对我有用。
这里是html的例子:
<html
xmlns:th="http://www.thymeleaf.org">
<head>
<meta content="text/html; charset=UTF-8" http-equiv="content-type"/>
<link rel="preconnect" href="https://fonts.googleapis.com">
<link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
<link href="https://fonts.googleapis.com/css2?family=Noto+Serif+Ethiopic&display=swap" rel="stylesheet">
</head>
<body class="c9" style="font-family: 'Noto Serif Ethiopic', serif;">
<p> {my content here} </p>
所以使用 link 并使用特定于我需要的语言的 google 字体为我解决了这个问题。
我希望这对其他人有帮助。
我使用 Jsoup 和 OpenHTMLToPDF. I have to use a different font in my PDF to have non-latin glyphcs covered (see here 从 HTML 创建了一个 PDF。如何正确嵌入我的字体?
重现问题的简化程序:
src/main/resources/test.html
<!DOCTYPE html>
<html>
<head>
<meta charset="UTF-8" />
<title>Font Test</title>
<style>
@font-face {
font-family: 'source-sans';
font-style: normal;
font-weight: 400;
src: url(fonts/SourceSansPro-Regular.ttf);
}
</style>
</head>
<body>
<p style="font-family: 'source-sans',serif">Latin Script</p>
<p style="font-family: 'source-sans',serif">Είμαι ελληνικό κείμενο.</p>
</body>
</html>
- 此文件应写成 PDF
- 在浏览器中它看起来正确并使用 Source Sans 字体。
src/main/java/main.java:
import com.openhtmltopdf.extend.FSSupplier;
import com.openhtmltopdf.pdfboxout.PdfRendererBuilder;
import org.jsoup.Jsoup;
import org.jsoup.helper.W3CDom;
import org.w3c.dom.Document;
import java.io.*;
import java.nio.charset.StandardCharsets;
import java.util.Objects;
public class main {
public static void main(String[] args) {
System.out.println("Starting");
try {
final W3CDom w3cDom = new W3CDom();
final Document w3cDoc = w3cDom.fromJsoup(Jsoup.parse(readFile()));
final OutputStream outStream = new FileOutputStream("test.pdf");
final PdfRendererBuilder pdfBuilder = new PdfRendererBuilder();
pdfBuilder.useFastMode();
pdfBuilder.withW3cDocument(w3cDoc, "/");
pdfBuilder.useFont(new File(main.class.getClassLoader().getResource("fonts/SourceSansPro-Regular.ttf").getFile()), "source-sans");
pdfBuilder.toStream(outStream);
pdfBuilder.run();
outStream.close();
} catch (Exception e) {
System.out.println("PDF could not be created: " + e.getMessage());
}
System.out.println("Finish.");
}
private static String readFile() throws IOException {
final ClassLoader classLoader = main.class.getClassLoader();
final InputStream inputStream = classLoader.getResourceAsStream("test.html");
final StringBuilder sb = new StringBuilder();
final Reader r = new InputStreamReader(Objects.requireNonNull(inputStream), StandardCharsets.UTF_8);
char[] buf = new char[1024];
int amt = r.read(buf);
while(amt > 0) {
sb.append(buf, 0, amt);
amt = r.read(buf);
}
return sb.toString();
}
}
- 不用理会第二个函数,它只是读取HTML文件并且只包含在这里,以获得完整的程序。
src/main/resources/fonts/SourceSansPro-regular.ttf
pom.xml
<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>paf</groupId>
<artifactId>test</artifactId>
<version>1.0-SNAPSHOT</version>
<build>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-compiler-plugin</artifactId>
<configuration>
<source>7</source>
<target>7</target>
</configuration>
</plugin>
</plugins>
</build>
<dependencies>
<dependency>
<groupId>com.openhtmltopdf</groupId>
<artifactId>openhtmltopdf-pdfbox</artifactId>
<version>0.0.1-RC18</version>
</dependency>
<dependency>
<groupId>org.jsoup</groupId>
<artifactId>jsoup</artifactId>
<version>1.11.2</version>
</dependency>
</dependencies>
</project>
程序输出:
Starting
com.openhtmltopdf.load INFO:: TIME: parse stylesheets 148ms
com.openhtmltopdf.match INFO:: media = print
com.openhtmltopdf.match INFO:: Matcher created with 147 selectors
com.openhtmltopdf.load INFO:: Loading font(source-sans) from InputStream supplier now.
com.openhtmltopdf.exception WARNING:: bad URL given: /fonts/SourceSansPro-Regular.ttf
com.openhtmltopdf.exception WARNING:: Could not load @font-face font: /fonts/SourceSansPro-Regular.ttf
com.openhtmltopdf.exception WARNING:: Font metrics not available. Probably a bug.
com.openhtmltopdf.exception WARNING:: Font metrics not available. Probably a bug.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
com.openhtmltopdf.render WARNING:: Font is null.
Finish.
生成 PDF
Latin Script
##### ######## #######.
- 衬线字体。
编辑 1:根据评论中链接的页面进行各种更改并更新到 RC18。现在有新的输出,但 PDF 中的字体仍然不正确。
编辑 2:尝试了快速渲染器
好的。感谢@Tilman Hausherr 的评论,我在 GitHub-Issue Tracker of openhtmltopdf and got some help.
中提问如果有人登陆这里感兴趣,这些更改使其有效:
src/main/java/main.java(仅更改部分,请参阅上面的其余部分):
public static void main(String[] args) {
System.out.println("Starting");
try {
final W3CDom w3cDom = new W3CDom();
final Document w3cDoc = w3cDom.fromJsoup(Jsoup.parse(readFile()));
final OutputStream outStream = new FileOutputStream("test.pdf");
final PdfRendererBuilder pdfBuilder = new PdfRendererBuilder();
pdfBuilder.useFastMode();
pdfBuilder.withW3cDocument(w3cDoc, "/");
pdfBuilder.useFont(new File(main.class.getClassLoader().getResource("fonts/SourceSansPro-Regular.ttf").getFile()), "source-sans");
pdfBuilder.toStream(outStream);
pdfBuilder.run();
outStream.close();
} catch (Exception e) {
System.out.println("PDF could not be created: " + e.getMessage());
}
System.out.println("Finish.");
}
src/main/resources/fonts/SourceSansPro-regular.ttf
来自 src/main/resources/test.html(仅更改部分,请参阅上面的其余部分)
@font-face {
font-family: 'source-sans';
font-style: normal;
font-weight: 400;
src: url(fonts/SourceSansPro-Regular.ttf);
-fs-font-subset: complete-font;
}
以上所有解决方案都试过了,但对我没有用。
然而,使用标记代替 @font-face 对我有用。
这里是html的例子:
<html
xmlns:th="http://www.thymeleaf.org">
<head>
<meta content="text/html; charset=UTF-8" http-equiv="content-type"/>
<link rel="preconnect" href="https://fonts.googleapis.com">
<link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
<link href="https://fonts.googleapis.com/css2?family=Noto+Serif+Ethiopic&display=swap" rel="stylesheet">
</head>
<body class="c9" style="font-family: 'Noto Serif Ethiopic', serif;">
<p> {my content here} </p>
所以使用 link 并使用特定于我需要的语言的 google 字体为我解决了这个问题。
我希望这对其他人有帮助。