TemEx: Template Extractor par Josep Silva
This tool automatically extracts the template of a webpage.
In order to identify the template, this tool (1) analyzes the webpages linked by the current webpage, and (2) identifies common HTML structures. The common HTML structure is the template.
Vous avez besoin de Firefox pour utiliser cette extension
MĂ©tadonnĂ©es de lâextension
Captures dâĂ©cran
Ă propos de cette extension
A web template is a prepared HTML page where formatting is already implemented and visual components are ready to insert content. Templates are used as a basis for composing new webpages that share a common look and feel. This is good for web development because many tasks can be automated thanks to the reuse of components. In fact, many websites are maintained automatically by code generators that generate webpages using templates. Templates are also good for users, which can benefit from intuitive and uniform designs with a common vocabulary of colored and formatted visual elements.
This tool implements a novel method for automatic template extraction that is based on similarity analysis between the DOM trees of a collection of webpages that are detected using menus information. The tool identifies a set of webpages that potentially share the shame template as the current webpage. Then, these webpages are analyzed and the common HTML structure is returned as the web template.
The web template is shown to the user as a webpage. Therefore, this tool can be seen as a filtering tool that removes the content of the webpage and only the template remains.
More information about this addon can be found at:
http://www.dsic.upv.es/~jsilva/retrieval/Web-TemEx
This tool implements a novel method for automatic template extraction that is based on similarity analysis between the DOM trees of a collection of webpages that are detected using menus information. The tool identifies a set of webpages that potentially share the shame template as the current webpage. Then, these webpages are analyzed and the common HTML structure is returned as the web template.
The web template is shown to the user as a webpage. Therefore, this tool can be seen as a filtering tool that removes the content of the webpage and only the template remains.
More information about this addon can be found at:
http://www.dsic.upv.es/~jsilva/retrieval/Web-TemEx
Ăvaluez votre expĂ©rience
PermissionsEn savoir plus
Ce module a besoin de :
- Vous afficher des notifications
- Accéder aux onglets du navigateur
- Accéder à vos données pour tous les sites web
Plus dâinformations
- Liens du module
- Version
- 1.8.1
- Taille
- 53,08Â Ko
- DerniĂšre mise Ă jour
- il y a 4 ans (2 juin 2020)
- Catégories associées
- Licence
- Licence personnalisée
- Contrat de licence dâutilisateur final
- Lire le contrat de licence de ce module
- Historique des versions
Ajouter Ă la collection
Plus de modules créés par Josep Silva
- Il nây a aucune note pour lâinstant
- Il nây a aucune note pour lâinstant
- Il nây a aucune note pour lâinstant
- Il nây a aucune note pour lâinstant
- Il nây a aucune note pour lâinstant
- Il nây a aucune note pour lâinstant