htaccess 没有将索引页面重新路由到 prerender.io 缓存(但重新路由子页面)

htaccess not rerouting index page to prerender.io cache (but reroutes subpages)

这个有效: 例如。com/24

这不是: example.com

我不确定问题是在 htaccess 中还是在 apache conf 中。

.htaccess 代码

        <IfModule mod_headers.c>
            RequestHeader set X-Prerender-Token "xxxxxxxxxxxxx"
        </IfModule>

        <IfModule mod_rewrite.c>
            RewriteEngine on

            # Redirect www to non-www
            RewriteCond %{HTTP_HOST} ^www\.(.*) [NC]
            RewriteRule ^(.*)$ http://%1/ [R=301,L]

            Options +FollowSymLinks
            #RewriteRule ^api/(.*)$ http://vivule.ee/api/ [P,L]

            # Don't rewrite files or directories, but exclude adminer directory
            RewriteRule ^(adminer)($|/) - [L]
            RewriteCond %{REQUEST_FILENAME} -f
            RewriteRule ^ - [L]

            # Prerender.io stuff
            <IfModule mod_proxy_http.c>
                RewriteCond %{HTTP_USER_AGENT} Googlebot|bingbot|Googlebot-Mobile|Baiduspider|Yahoo|YahooSeeker|DoCoMo|Twitterbot|TweetmemeBot|Twikle|Netseer|Daumoa|SeznamBot|Ezooms|MSNBot|Exabot|MJ12bot|sogou\sspider|YandexBot|bitlybot|ia_archiver|proximic|spbot|ChangeDetection|NaverBot|MetaJobBot|magpie-crawler|Genieo\sWeb\sfilter|Qualidator.com\sBot|Woko|Vagabondo|360Spider|ExB\sLanguage\sCrawler|AddThis.com|aiHitBot|Spinn3r|BingPreview|GrapeshotCrawler|CareerBot|ZumBot|ShopWiki|bixocrawler|uMBot|sistrix|linkdexbot|AhrefsBot|archive.org_bot|SeoCheckBot|TurnitinBot|VoilaBot|SearchmetricsBot|Butterfly|Yahoo!|Plukkie|yacybot|trendictionbot|UASlinkChecker|Blekkobot|Wotbox|YioopBot|meanpathbot|TinEye|LuminateBot|FyberSpider|Infohelfer|linkdex.com|Curious\sGeorge|Fetch-Guess|ichiro|MojeekBot|SBSearch|WebThumbnail|socialbm_bot|SemrushBot|Vedma|alexa\ssite\saudit|SEOkicks-Robot|Browsershots|BLEXBot|woriobot|AMZNKAssocBot|Speedy|oBot|HostTracker|OpenWebSpider|WBSearchBot|FacebookExternalHit [NC,OR]
                RewriteCond %{QUERY_STRING} _escaped_fragment_

                # Only proxy the request to Prerender if it's a request for HTML
                RewriteRule ^(?!.*?(\.js|\.css|\.xml|\.less|\.png|\.jpg|\.jpeg|\.gif|\.pdf|\.doc|\.txt|\.ico|\.rss|\.zip|\.mp3|\.rar|\.exe|\.wmv|\.doc|\.avi|\.ppt|\.mpg|\.mpeg|\.tif|\.wav|\.mov|\.psd|\.ai|\.xls|\.mp4|\.m4a|\.swf|\.dat|\.dmg|\.iso|\.flv|\.m4v|\.torrent))(.*) http://service.prerender.io/http://vivule.ee/ [P,L]
            </IfModule>

            # Rewrite everything else to index.html to allow html5 state links
            RewriteRule ^adminer - [L,NC]
            RewriteRule ^ index.html [L]

        </IfModule>

我正在测试这个: https://developers.facebook.com/tools/debug/og/object/

问题网页: http://vivule.ee/

这可能会匹配您的主页并在预渲染配置为 运行

之前提供 index.html
RewriteCond %{REQUEST_FILENAME} -f
RewriteRule ^ - [L]

尝试将预呈现配置移到文件中更高的位置。

好的,我是这样解决的。事实证明问题出在 Apache 2.4 设置上,出于某种原因它绕过了预渲染服务器并从原始服务器提供原始 html。我通过在 htaccess 文件中添加 "DirectoryIndex" 解决了这个问题。没有添加参数,这会将页面索引设置为 http://example.com/.

这是我的最终代码:

        <IfModule mod_headers.c>
            RequestHeader set X-Prerender-Token "XXXXXXXXXXXX"
        </IfModule>

        <IfModule mod_rewrite.c>
            DirectoryIndex
            RewriteEngine on

            # Redirect www to non-www
            RewriteCond %{HTTP_HOST} ^www\.(.*) [NC]
            RewriteRule ^(.*)$ http://%1/ [R=301,L]

            Options +FollowSymLinks
            #RewriteRule ^api/(.*)$ http://vivule.ee/api/ [P,L]


            # Prerender.io stuff
            <IfModule mod_proxy_http.c>
                RewriteCond %{HTTP_USER_AGENT} Baiduspider|DoCoMo|Twitterbot|TweetmemeBot|Twikle|Netseer|Daumoa|SeznamBot|Ezooms|MSNBot|Exabot|MJ12bot|sogou\sspider|bitlybot|ia_archiver|proximic|spbot|ChangeDetection|NaverBot|MetaJobBot|magpie-crawler|Genieo\sWeb\sfilter|Qualidator.com\sBot|Woko|Vagabondo|360Spider|ExB\sLanguage\sCrawler|AddThis.com|aiHitBot|Spinn3r|BingPreview|GrapeshotCrawler|CareerBot|ZumBot|ShopWiki|bixocrawler|uMBot|sistrix|linkdexbot|AhrefsBot|archive.org_bot|SeoCheckBot|TurnitinBot|VoilaBot|SearchmetricsBot|Butterfly|Yahoo!|Plukkie|yacybot|trendictionbot|UASlinkChecker|Blekkobot|Wotbox|YioopBot|meanpathbot|TinEye|LuminateBot|FyberSpider|Infohelfer|linkdex.com|Curious\sGeorge|Fetch-Guess|ichiro|MojeekBot|SBSearch|WebThumbnail|socialbm_bot|SemrushBot|Vedma|alexa\ssite\saudit|SEOkicks-Robot|Browsershots|BLEXBot|woriobot|AMZNKAssocBot|Speedy|oBot|HostTracker|OpenWebSpider|WBSearchBot|FacebookExternalHit [NC,OR]
                RewriteCond %{QUERY_STRING} _escaped_fragment_

                # Only proxy the request to Prerender if it's a request for HTML
                RewriteRule ^(?!.*?(\.js|\.css|\.xml|\.less|\.png|\.jpg|\.jpeg|\.gif|\.pdf|\.doc|\.txt|\.ico|\.rss|\.zip|\.mp3|\.rar|\.exe|\.wmv|\.doc|\.avi|\.ppt|\.mpg|\.mpeg|\.tif|\.wav|\.mov|\.psd|\.ai|\.xls|\.mp4|\.m4a|\.swf|\.dat|\.dmg|\.iso|\.flv|\.m4v|\.torrent))(.*) http://service.prerender.io/http://vivule.ee/ [P,L]
            </IfModule>

            # Don't rewrite files or directories, but exclude adminer directory
            RewriteRule ^(adminer)($|/) - [L]
            RewriteCond %{REQUEST_FILENAME} -f
            RewriteRule ^ - [L]

            # Rewrite everything else to index.html to allow html5 state links
            RewriteRule ^adminer - [L,NC]
            RewriteRule ^ index.html [L]

        </IfModule>