Warning: strpos(): needle is not a string or an integer in /home/allisona/public_html/kcocco/index.php on line 31

Warning: strpos(): needle is not a string or an integer in /home/allisona/public_html/kcocco/index.php on line 37
KCOCCO ~ photo
Christmas Eve Dinner 2006 

Jordan and Dad

Baccala (Salted Cod) Carpaccio & BBQ Shrimp



Ethan and huge crab leg

  |  [ 0 trackbacks ]   |  permalink  |  related link
Ocean City, New Jersey, Coccos 



  |  [ 0 trackbacks ]   |  permalink  |  related link
India, Darjeeling I 

Darjeeling city from the balcony of the TV tower hotel...cheap, about $2.25 per night and good restaurant.

Spice shop. Many locals have the shop keeper put together a small mix of spices for one meal.

The shop keeper cutting fish with a fixed blade in the middle of the little road side shop

Goat heads and a hook full of something for sale!

Get your veggies and hair cut within 1 foot.

  |  [ 0 trackbacks ]   |  permalink  |  related link
India, Dehli 

The traditional traveling ear cleaner working on Younis. Looked so unfun that I passed on my cleaning.

Qutb tower

Imam Zamim Tomb

Qutb arch

Lotus Temple

Humayuns Tomb

Anoop Hotel Dehli

Traditional Kashmir dinner with Mudasar & Younis. Lamb: shank, kabab, ball with yogurt, ball with red something and rice.
  |  [ 0 trackbacks ]   |  permalink  |  related link
Perl Stripping 
Over the holiday Dad and I hacked a perl script that parses HTML pages from FoodNetwork.com. The goal was to get the episode numbers & names for Molto Mario recipes. Our first step was to save the index page of Mario recipes from FoodNetwork.com . We then used the Unix command wget to search the page for URL links and save the associated HTML pages into one directory. We then created a Perl script that opens each of the HTML files found in the directory and pulls out the data we needed.
Here is an example of the output(episode number, name, recipe name) now ready to load into a database:
MB2G15~Trastevere On a Sunday~Gnudi con Fiori di Zucca
MB2G15~Trastevere On a Sunday~Chicken with Sweet Peppers: Pollo con Peperoni
MB2G17~Antica Bessetta~Bigoli - Basic Recipe
MB2G17~Antica Bessetta~Bigoli Bianchi with Duck Ragu
MB2G17~Antica Bessetta~Bigoli Scuri

Here is the general script that can easily be altered to parse other shows:
#!/usr/bin/perl
####################################################################
# recipestrip.pl
# Text stripping script. Used on Foodnetwork html pages.
# 3.28.2005 K & L Cocco
#
# Program used to capture episode number & title and recipe tiles
# from html files.
# The raw html files were gathered from foodnetwork.com with wget command
# using flags -i and -F.
# example: wget -F -i capturedhtmlfile.html
#
####################################################################
use Getopt::Long;
use File::Basename;

sub trimwhitespace($)
{
my $string = shift;
$string =~ s/^s+//;
$string =~ s/s+$//;
return $string;
}

$path="/video1/mario/shows/";

opendir(SHOWS, "$path");
@Allnames = readdir(SHOWS);

open (koutputfile, ">episoderecipelist.txt") :: die ("Could not open file. $!");
foreach $Name (@Allnames) {
if (-d $Name) {next};
($show, $path, $suffix) = fileparse($Name, ".html");
if ($suffix ne ".html") {next};
#print $Name,"n";

open (inputfile, $Name) :: die ("Could not open file. <br> $!");

foreach $text (<inputfile>){
# chomp $text;
#print $text;
if ($text =~ /articleshowname/){
$stopsl = index($text,"Episode ")+8;
$lensl = index($text,"</SPAN><P") - $stopsl;
$episodenum = substr($text,$stopsl,$lensl);
#print substr($text,$stopsl,$lensl),"n";
print $episodenum,"n";
# print $text,"n";
#print "stopsl: ",$stopsl, " lensl: ",$lensl,"n";
#print "*****************************************n";
}
if ($text =~ /episodename/){
$stopsl = index($text,"name'>")+6;
$lensl = index($text,"</SPAN") - $stopsl;
$episodename = substr($text,$stopsl,$lensl);
#print substr($text,$stopsl,$lensl),"n";
#print $episodename,"n";
# print $text,"n";
#print "stopsl: ",$stopsl, " lensl: ",$lensl,"n";
#print "******************************************n";
}
if ($text =~ /recipes/recipe/){
$startsl = index($text,"html'>")+6;
$stopsl = index($text, "</a></TD>");
# captures lines with no ending tag
if ($stopsl == -1) {
$stopsl = length($text);
}
$lensl = $stopsl - $startsl;
print koutputfile trimwhitespace($episodenum)."~".trimwhitespace($episodename)."~".trimwhitespace(substr($text,$startsl,$lensl)),"n";
#print $Name,"n";
#print "stopsl: ",$stopsl, " lensl: ",$lensl,"n";
#print "******************************************";
}
}
close (inputfile);
}
print "*** Run completion ***n";
close (koutputfile);
  |  [ 0 trackbacks ]   |  permalink  |  related link

Back Next

// Google Analytics