This is a JavaScript module for parsing HTML strings and converting them into structured objects. It uses the JSDOM library to create a DOM environment and then recursively parses HTML nodes and their children.
You can install this module using npm or yarn:
npm install html2obj-maciej-kolber
npm install --save-dev jest
npm start 'your-html-file'('markup.html')
Once installed, you can use the parseHTML
function in your JavaScript code as follows:
const parseHTML = require("html2obj-maciej-kolber");
const htmlString =
'<div style="background-color: yellow; font-size: 14px" id="first-div">Hello, friends <p class="para" style="font-family: monospace; font-size: 11px">Lorem ipsum dolor sit</p><footer style="width: auto; height: 100px; color: blue"><span>This is the end</span></footer><div>';
const parsedObject = parseHTML(htmlString);
console.log(parsedObject);
This function takes an HTML string as input and returns a structured object representing the parsed HTML. The object structure includes the following properties:
tag
: The HTML tag name (lowercased).id
: Theid
attribute of the element (if present).class
: Theclass
attribute of the element, stored as a string or an array if there are multiple classes.style
: An object containing CSS style properties and their values.text
: The text content of the element (only for top-level non-empty text nodes).children
: An array of child elements, each represented as a similar structured object.
If the HTML string does not contain a top-level <div>
element, the function returns null
.
Suppose you have the following HTML string:
<div style="background-color: yellow; font-size: 14px" id="first-div">
Hello, friends
<p class="para" style="font-family: monospace; font-size: 11px">
Lorem ipsum dolor sit
</p>
<footer style="width: auto; height: 100px; color: blue">
<span> This is the end </span>
</footer>
</div>
Using the parseHTML
function on this string will result in the following parsed object:
{
"tag": "div",
"id": "first-div",
"style": {
"background-color": "yellow",
"font-size": "14px"
},
"text": "Hello, friends",
"children": [
{
"tag": "p",
"class": "para",
"style": {
"font-family": "monospace",
"font-size": "11px"
},
"text": "Lorem ipsum dolor sit"
},
{
"tag": "footer",
"style": {
"width": "auto",
"height": "100px",
"color": "blue"
},
"children": [
{
"tag": "span",
"text": "This is the end"
}
]
}
]
}
This module is distributed under the MIT License. Feel free to use and modify it according to your needs.
Please note that this module uses the JSDOM library internally, and you should review its licensing terms as well.