first_imgHow to Write a Welcome Email to New Employees? Tags: #Big Data#hack Growing Phone Scams: 5 Tips To Avoid Related Posts is a personal finance website with a mission to “help US consumers make smarter decisions with their money”. What really makes it stand out is the company’s unique access to detailed, anonymized transaction histories from 20 million Citibank credit cards. This allows them to build consumer tools in the same vein as Mint, but with a deep foundation of information to compare to right from the first user. Last week I sat down with CEO Jaidev Shergill, CTO Phil Kim and data scientist Alex Hasha to learn more about what they’re doing with such a powerful data set.The first question they wanted to address was the obvious one of how do they ensure privacy and security when dealing with such sensitive information? Everything is held in a secure data center, and no direct personally identifiable information is included in the histories – everything’s anonymized. The team also takes further steps, like identifying and removing healthcare-related payments. I asked them though, doesn’t it still make people a bit uncomfortable? Their response was that their whole business was based around helping consumers, and their investor Citi only shares the data on very strict conditions because they believe Bundle’s work will make customers lives better. CTO Phil Kim laid out their philosophy:Bundle takes great pains to protect the privacy of users. First, we hold ourselves to strict, bank-level information security standards, which means that sensitive data is held in a secure data center and access is heavily restricted, and that the Bundle application is heavily scrutinized for vulnerabilities on a regular basis, to prevent accidental or malicious leakage of user data. Second, much of the data analysis and synthesis work we do relies on data that has been sampled, modeled, flattened, or otherwise transformed — we rarely work with raw transaction data, and we never work with data that has a direct linkage back to a named customer. Last but not least, Bundle is very much focused on building tools to help consumers — remember that this data is not new… large companies use data just like this to market products and make business decisions — Bundle is simply trying to share this data with consumers.Alex Hasha described how he’d worked in the finance industry as a quant, working in a team of over two hundred PhDs to analyze financial instruments. The attraction of for him was the chance to work on something that offered direct benefits to ordinary users, a refreshing change from the abstract world of high finance. Users upload information from their own bank and credit card accounts onto the site, and in return they get back a score card showing how their spending compares to people like themselves. For example, you might discover that you’re spending a lot more on groceries than other people in your neighborhood, and you’d be better off switching to a cheaper supermarket.The key to all of their work is the value that they’re able to extract from aggregate information, things like how much people in a particular zip code spend on particular categories such as eating out, groceries and transportation. Because this is the result of blending and averaging large numbers of different accounts, it helps reduce the risk that any sensitive information will leak out. What’s really impressive about the data they possess is its broad coverage. Almost every merchant in the U.S. will be represented, and it has the potential to offer the deep customer analytics that website publishers are used to. I could imagine it being used by restaurant owners to spot when they’re losing previously loyal customers for example. won’t speculate on where they will take their product in the future, but did want to emphasize how everything was driven by their mission to help consumers.I spent a bit of time talking with them about the technical challenges of their work, too. Credit card systems are often 30 or 40 years old, and so the data they get back is often very messy. You know how you look at your statements and try to decipher what “MCDON 94117” could be? That’s one of their biggest obstacles, the names of the merchants are often incomplete and unclear, so they have a whole system devoted to making sense of this unstructured data. “MCDON”, “MCD” and “MCDONALDS” all likely to refer to the restaurant, which allows them to categorize any transactions as food purchases. A large amount of their code is written in Perl, since they’re big fans of CPAN’s rich repository of libraries, and runs in-memory, so it’s not a classic big data problem. They also rely on R for some of their analysis, thanks to its rich toolkit of statistical functions.The data mining of billions of credit card transactions is bound to raise a lot of questions, but it was clear to me that is serious in its mission to help consumers. Its product certainly seems to offer a lot more value to the wider world than anything that Wall Street’s quants have produced.center_img 7 Types of Video that will Make a Massive Impac… Why You Love Online Quizzes pete wardenlast_img read more

first_imgThe Uttar Pradesh government on Sunday shunted out the District Magistrate and the Superintendent of Police of Sonbhadra, besides ordering action against 13 other officials after they were indicted in an inquiry into the killing of 10 Gond tribals last month over a land dispute.‘FIRs against guilty’ Addressing a press conference at his residence, Chief Minister Yogi Adityanath said FIRs will be registered against several police and administration officials for alleged irregularities and members of the Adarsh Krishi Sahkari Samiti, Umbha, on charges of land grabbing. The disputed land in Umbha and Saphi villages will also be transferred back and registered in the name of the gram sabha, he said, while announcing that a Special Investigation Team will look into the matter. Mr. Adityanath said departmental proceedings have been initiated against Sonbhadra District Magistrate Ankit Kumar Agrawal and Superintendent of Police Salman Taj Patil for taking “one-sided decision” against the villagers. Directives have been issued to attach Mr. Agrawal and Mr. Patil to the Personnel Department and the DGP Headquarters, respectively, he said. S. Ramalingam has been made the new District Magistrate of Sonbhadra, while Prabhakar Chaudhary is the new Superintendent of Police, officials said. “The entire matter will be probed by an SIT. The SIT will be headed by DIG SIT J. Ravindra Gaud and will have Additional SP Amrita Mishra along with three inspectors. DG SIT R.P. Singh will be monitoring the work of the SIT,” the Chief Minister said. He said another team will be set up under Additional Chief Secretary (Revenue) Renuka Kumar to look into the issue of land grabbing by “fake” societies in the last 60-70 years in Mirzapur and Sonbhadra.‘No action on report’ “Fake societies in Sonbhadra and Mirzapur have grabbed more than one lakh acre of land. In 1972, the then Chief Minister Hemwati Nandan Bahuguna had constituted a probe committee under Mangaldev Visharad. However, no action was taken as a number of Congress leaders were involved,” Mr. Adityanath said.last_img read more