Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

javascript: read plain html-string and using DOMparser change links path

In my angular app using one of WYSIWYG i can insert links without protocol. And this is bad:

i need to parse string and change all link's (if thay didn't have protocol to http://...)

and i try to do so:

var content = '<p>7</p><p>77</p><p><br></p><p><a href="http://example.com" rel="nofollow">http://example.com</a></p><p><br></p><p><a href="example.com" target="_blank">example.com</a></p><p><br></p><p><a href="ftp://localhost">ftp://localhost</a></p><p><br></p><p><a href="localhost">localhost</a><br></p>';

var addProtocolToLinks = function(URL){
    var protocols = ['http', 'https', 'ftp', 'sftp', 'ssh', 'smtp'];
    var withProtocol = false;
    if (URL.length > 0){
      protocols.forEach(function(el) {
        if (URL.slice(0,4).indexOf(el) > -1){
          withProtocol = true;
        }
      });
      var newURL =  URL;
      if (!withProtocol){
        newURL = 'http://' + URL;
      }
      console.log(newURL + '   ' + URL);
      return newURL;
    }
};

var parser = new DOMParser();
var doc = parser.parseFromString(content, "text/html");
var links = doc.getElementsByTagName("a");
for(var i=0; i<links.length; i++) {
    links[i].setAttribute('href', addProtocolToLinks(links[i].href));
    console.log('result: ' + links[i].getAttribute('href'));
}

console.log('result html: ');
console.log(doc);  // also i need to fetch only my var content part, without html, body etc

http://jsfiddle.net/r3dgeo23/

But for some reasons it's not working properly. What i do wrong?

like image 997
brabertaser19 Avatar asked Jul 21 '15 11:07

brabertaser19


2 Answers

you had almost everything right except that:

link[i].href

returns undefined if no protocol set. Therefore you gave you function addProtocolToLinks(undefined) and it did not work.

You can use:

getAttribute('href');

to make it work, see this fiddle: http://jsfiddle.net/r3dgeo23/3/

/////EDIT

Here is a fiddle for only fetching the content part and not the whole html: http://jsfiddle.net/r3dgeo23/5/

/////EDIT2

Create the container with unique id within your function:

var container = document.createElement('div');
container.setAttribute("id", "content");
container.innerHTML = content;

http://jsfiddle.net/r3dgeo23/6/

like image 142
Jonny Vince Avatar answered Oct 06 '22 03:10

Jonny Vince


If I completely understood your question, this should work...

    function jsF_addHTTP( url )
    {

        if (url !== "") 
        {
            // Insert HTTP if it doesn't exist.

            if ( !url.match("^(http|https|ftp|sftp|ssh|smtp)://") ) 
            {
                url = "http://" + url;
            }
        }
        return url;
    }
like image 21
Devang Mistry Avatar answered Oct 06 '22 02:10

Devang Mistry