Friday, June 29, 2012
Sunday, May 13, 2012
Hadoop - Forrester Report
AWS is feature rich ?? Need to check it out.
Forrester claiming AWS is feature rich. I dont know how.
hadoop modelling : MR, Pig, Mahout
Storage and Data MGMT : HDFS, HBase, Cassandra
Data warehousing,summarization, query : Hive, Sqoop
Data Collection / Aggregation / analysis : Chukwa, Flume
metadata/table/schema : HCatalog
Cluster mgmt/job sched/workflow : zookeeper, oozie, Ambari
Hadoop Data Serialization : Avro
Good News : Hadoop is full of immature products
Hadoop productions :
MapR = Strong OEM, feature rich ( why feature rich )
HortonWorks : Efficient echo system, everything
Penhaho : BI tool for big data, works on thrid party hadoop clusters
DataMeer : Hadoop modelling tool ( a modelling tool for nonrelational database .. LOL )
Companies involving hadoop :
Composite Software, Concurrent, Dell, Endeca, Hadapt, HP/Vertica,Informatica, NetApp, Microsoft, Oracle, Pervasive, Platfora, Quest Software, RainStor, SAP/Sybase, SGI,and StackIQ. Karmasphere and Teradata/Aster Data,
Forrester : HStreaming, Outerthought, Zettaset, Platform Computing, Pentaho, Datameer, DataStax, HortonWOrks, Cloudera, MapR, GreenPlum, EMC, IBM, AWS
Friday, April 20, 2012
Compare Apache Wicket vs Vaadin
If you are not interested in reading the whole forum then understand that Wicket has more advantages than Vaadin. Vaadin has advantages of beautiful themes but wicket can integrate with GWT.
Vaadin is better if you just want to stick with java and not interested in CSS or javascript. But if you put in your resume that you are a web developer but not good at CSS or JavaScript then it automatically turns against you. So Vaadin wont be advantageous here. While using Wicket comes with some skills required to imagine the GUI. For developers designing GUI is also useful virtue. Does not need to be flashy but certainly useful GUI rocks.
If desktop look and feel is needed then Vaadin is good as I already suggested, but wicket is no less. You can use GWT to create such look and feels.
Thursday, March 8, 2012
datasets
This site links to various dataset across the federal government.
www.Data.gov -- repository for data and geodata from across the federal government, searchable by agency, data type, keyword, among others. Many, but not all Commerce data sets reside here.
Data.gov Commerce Data and tools can be found at
Data
http://www.data.gov/list/agency/1/0/catalog/raw/page/1/count/50
Tools
http://www.data.gov/list/agency/1/0/catalog/tools/page/1/count/50
Census Data – Population, household, housing, demographics. Economic census and monthly indicators, foreign trade, employment statistics, maps, geographic data and more. http://www.census.gov/
http://www.census.gov/main/www/access.html
List of Open Data Sites Around the World: http://www.data.gov/opendatasites
Amazon Listing of Public Data Sets: http://aws.amazon.com/datasets
Washington DC: http://data.octo.dc.gov
New York: http://nyc.gov/data
San Francisco: http://datasf.org/
U.K.: http://data.gov.uk/
http://www.cs.cmu.edu/~awm/10701/project/data.html
Wednesday, January 4, 2012
Convert Excel to CSV
- Convert Excel to CSV
- If Excel has multiple sheets save them as
- Convert all the files in specified directory ( i.e. C:\ )
Dim oFSO
Dim oShell, oExcel, oFile, oSheet
Set oFSO = CreateObject("Scripting.FileSystemObject")
Set oShell = CreateObject("WScript.Shell")
Set oExcel = CreateObject("Excel.Application")
oExcel.DisplayAlerts = False
For Each oFile In oFSO.GetFolder("C:\\drop\\MarketShare\\2010").Files
WScript.echo "Hello"
If LCase(oFSO.GetExtensionName(oFile)) = "xls" Then
WScript.echo oFile
With oExcel.Workbooks.Open(oFile, 0, True, , , , True, , , , False, , False)
For Each oSheet In .Worksheets
oSheet.SaveAs "C:\drop\MarketShare\2010\" & oFile.Name & "." & oSheet.Name & ".csv", 6
Next
.Close False, , False
End With
End If
Next
oExcel.Quit
oShell.Popup "Conversion complete", 10