Have you ever creating offline archive of a website that you like? Which “offline downloader” did you use? Hopefully this post will give you rough idea on how to traverse the website ( and do whatever you want to do with the content ). My idea of traversing a website represented in the following steps:
- Define the startup page
- Collect all hyper-links that have not been visited and put it into queue
- Do something with the content
- Dequeue a hyper-link and return to step 2
- Repeat until no more links in the queue.
I have build my own prototype that suit my needs. I might need further tweaks and improvements before releasing it to the public. My main goal would be adding functionality to define characteristics of the page that will be traversed in step 2, and functionality of what you can do with the content of the page. Stay tuned for further updates.
About Hardono
Howdy! I'm Hardono. I am working as a
Software
Developer. I am working mostly in Windows, dealing with .NET, conversing in C#. But I know a bit of Linux, mainly because I need to keep this blog operational. I've been working in Logistics/Transport industry for more than 11 years.
I was reading my RSS Bandit (thanks to Kang Kombor) when I bumped an article about classification of blog posts. I will write down the excerpt here:
- The SpeedLinker
- This is where you gather links to something interesting in other blogs.
- The Quoter
- This is when you saw other people saying good things in their blog and you rewrite it in your own blog.
- The Entertainer
- This is where you entertain your readers by putting hilarious video from Youtube.com
- The Questioner
- This is where you ask your readers a question to start a discussion. You definitely need readers to begin with 🙂
- The Updater
- This is where you inform your readers regarding the progress of an ongoing project. It could be report on your own project/blog
- The Newsreader
- This is where you share interesting news you get from press release or newspaper.
- The Recycler
- This is where you put your old post as new, with additional update of course
- The Guest Poster
- This is where you let someone else creating post in your blog.
- The Announcer
- This is where you announce something to your readers. DOH!
- Source
- The 9 Essential Posts that Every Blogger Should Know
And I think this post itself will become the example of ‘The Quoter’ type of post.
About Hardono
Howdy! I'm Hardono. I am working as a
Software
Developer. I am working mostly in Windows, dealing with .NET, conversing in C#. But I know a bit of Linux, mainly because I need to keep this blog operational. I've been working in Logistics/Transport industry for more than 11 years.
After frustated unable to get my source codes displayed smaller, I decided to play around with CSS’ font-size for our amusement.
Done using Javascript:
<script language="javascript">
function UpdateFont()
{
var objTarget = document.getElementById('divtarget'); // a div
var objSourceFont = document.getElementById('txbFont'); // a textbox
var objSourceSize = document.getElementById('txbSize'); // a textbox
if (objTarget != null && objSourceFont != null && objSourceSize != null)
{
objTarget.style.fontSize = objSourceSize.value;
objTarget.style.fontFamily = objSourceFont.value;
}
}
</script>
- Resources:
- W3C Standard
W3School Tutorial -
About Hardono
Howdy! I'm Hardono. I am working as a
Software
Developer. I am working mostly in Windows, dealing with .NET, conversing in C#. But I know a bit of Linux, mainly because I need to keep this blog operational. I've been working in Logistics/Transport industry for more than 11 years.