聚类 Pentaho C.E。 5.x 与 Jackrabbit 存储库

Clustering Pentaho C.E. 5.x with Jackrabbit repository

我正在尝试 运行 多个 Pentaho BI CE 服务器实例位于集群 PostgreSQL 数据库之上。

关于集群的 Pentaho 集群指南(Cluster the Application Server)说我应该保持 Jackrabbit 每个节点配置的内容相同

Your application nodes all need the same configurations and BA deployments installed already in order for clustering to work.

然后我只需要配置 Jackrabbit 的日志来指定唯一的节点 ID。

<Cluster id="Unique_ID ">
    <Journal class="org.apache.jackrabbit.core.journal.DatabaseJournal">
      <param name="revision" value="${rep.home}/revision.log"/>
      <param name="url" value="jdbc:postgresql://HOSTNAME:PORT/jackrabbit"/>
      <param name="driver" value="org.postgresql.Driver"/>
      <param name="user" value="jcr_user"/>
      <param name="password" value="password"/>
      <param name="databaseType" value="postgresql"/>
      <param name="janitorEnabled" value="true"/>
      <param name="janitorSleep" value="86400"/>
      <param name="janitorFirstRunHourOfDay" value="3"/>
    </Journal>
</Cluster>

Jackrabbit's guide on clustering 但是有更多要求(强调我的)

In order to use clustering, the following prerequisites must be met:

  • Each cluster node must have its own repository configuration.

  • A DataStore must always be shared between nodes, if used.

  • The global FileSystem on the repository level must be shared (only the one that is on the same level as the data store; only in the repository.xml file).

  • Each cluster node needs its own (private) workspace level and version FileSystem (only those within the workspace and versioning configuration; the ones in the repository.xml and workspace.xml file).

  • Each cluster node needs its own (private) Search indexes.
  • Every cluster node must be assigned a unique ID.
  • A journal type must be chosen, either based on files or stored in a database.
  • Each cluster node must use the same (shared) journal.

  • The persistence managers must store their data in the same, globally accessible location

这是否意味着 'Versioning' 和 'Workspace' 中的文件系统每个节点应该有不同的前缀或指向另一个(可能是非共享的)地方?这与 Pentaho 文档 (Use PostgreSQL as Your Repository Database) 相矛盾,其中所有内容都指向单个数据库。

Does it mean that FileSystem inside 'Versioning' and 'Workspace' should have different prefixes per node or point to another(possibly non shared) place?

如果您使用共享数据库,那么可以。

This contradicts with the Pentaho documentation(Use PostgreSQL as Your Repository Database) where everything points to single database.

不一定,要看"same"的定义。如果您使用本地文件系统或本地数据库,那么 配置 与 "Your application nodes all need the same configurations and BA deployments installed already in order for clustering to work."

相同