# Extraction common issues
MarfeelPress uses the WordPress API to extract all the necessary content from the tenant.
The MarfeelPress API endpoints article explains which are those endpoints and what is their expected behavior.
# Section extraction
Section extraction relies on MarfeelPress Ripper's response to obtain the content. This endpoint is prepared to extract WordPress categories
and tags
. If the tenant uses custom taxonomies to group the content it will not be detected by MarfeelPress.
Some plugins use custom taxonomies and other ways to group articles that WordPress considers static pages. In Marfeel, they are extracted as static pages. This issue is solved either by updating that plugin's configuration to use the supported taxonomies, uninstalling it, or changing the Ripper to WhiteCollarRipper
.
WARNING
Before changing the ripper, escalate the issue to the Content-Platform chapter to validate there's not a more appropriate solution.
Another reason that can cause sections to fail is a bad LayoutDescriptor
configuration. Make sure all items have the required properties.
TIP
Manually request the content using the Ripper's route to validate the API response.
TIP
Figure out how to debug section extraction issues in the dedicated guide.
# Article extraction
Article extraction relies on MarfeelPress Extractor's response to obtain the content.
When an article fails to extract, entirely or partially, it means the content is not returned in the API response.
The main cause of this issue is installed plugins that modify the endpoint response, returning a malformed JSON
, changing the format of the response, or removing information from it.
To solve it, the conflicting plugin should be uninstalled. Alternatively, the extractor can be changed to BoilerpipeExtractor
to fetch the content directly to the HTML page, instead of via the WordPress API.
WARNING
Before changing the extractor, escalate the issue to the Content-Platform chapter to validate there's not a more appropriate solution.
TIP
Figure out how to debug article extraction issues in the dedicated guide.