Brazilian E-Commerce Public Dataset by Olist
Welcome! This is a Brazilian ecommerce public dataset of orders made at Olist Store. The dataset has information of 100k orders from 2016 to 2018 made at multiple marketplaces in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight performance to customer location, product attributes and finally reviews written by customers. We also released a geolocation dataset that relates Brazilian zip codes to lat/lng coordinates.
This is real commercial da..
DESCRIPTION
All TV Show details from IMDB
SUMMARY
Context
This dataset was created by our in house teams at PromptCloud (https://www.promptcloud.com/) and DataStock (https://datastock.shop/). This dataset contains a sample of 5K records in it.
Content
This dataset contains the following.
Acknowledgements
We wouldn't be here without the help of our web scraping and data mining experts at PromptCloud and DataStock.
Inspiration
The inspiration for this dataset was made keeping in mind the data analysts and researchers across ..
Sales Prediction for Big Mart Outlets
The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim is to build a predictive model and predict the sales of each product at a particular outlet.
Using this model, BigMart will try to understand the properties of products and outlets which play a key role in increasing sales.
Please note that the data may have missing values as some stores might not report all the data due ..
Description
In the two datasets freMTPLfreq and freMTPLsev, risk features are collected for 413,169 motor third-party liability policies (observed mostly on one year), in addition to claim numbers by policy as well as their corresponding claim amounts. freMTPLfreq contains the risk features and claim counts, whilst freMTPLsev contains claim amounts. Both tables can be linked together via the corresponding policy ID.
Additional information can be found at http://cas.uqam.ca/pub/web/CASdatasets-manual.pdf.
A health indicator is a measure designed to summarize information about a given priority topic in population health or health system performance. Health indicators provide comparable and actionable information across different geographic, organizational, or administrative boundaries and/or can track progress over time.
Context
In the dataset freMTPL2freq risk features and claim numbers were collected for 677,991 motor third-part liability policies (observed on a year).
Content
freMTPL2freq contains 11 columns (+IDpol): • IDpol The policy ID (used to link with the claims dataset). • ClaimNb Number of claims during the exposure period. • Exposure The exposure period. • Area The area code. • VehPower The power of the car (ordered categorical). • VehAge The vehicle age, in years. • DrivAge The driver age, in years (in France, people can drive a car at 18). • BonusMalus ..
Context
Now that this year's IPL is over, let's not curb our cricket love and start analyzing the whole of IPL with this latest and complete Indian Premier League dataset. It contains the match descriptions, results, winners, player of the matches, ball by ball dataset and much more. So, stop thinking and start analyzing .
Content
This dataset consists of two seperate CSV files : matches and deliveries. These files contain the information of each match summary and ball by ball details, respectively.
Acknowledgements
Data Source : Cricsheet
The training data contain transaction history for customers that ended up purchasing a policy. For each customer_ID, you are given their quote history. In the training set you have the entire quote history, the last row of which contains the coverage options they purchased.
What is a customer?
Each customer has many shopping points, where a shopping point is defined by a customer with certain characteristics viewing a product and its associated cost at a particular time.• Some customer characteristics may change over time (e.g. as the customer ch..
DESCRIPTION
Dataset of 22,000 fashion products on Amazon
SUMMARY
About this Dataset
This is a pre-crawled dataset, taken as subset of a bigger dataset (more than 7 million fashion products) that was created by extracting data from Amazon.
Objectives
Analyses of the ratings, price and reviews can be performed.
Background
This dataset was created by PromptCloud's in-house web-crawling service.
Context
Gaming is a very big industry now. Every year there are millions of Dollars invested in Esports and many new companies want to invest in the Esports scene now. One of bigegest ever deals was when Mixer opened up and brought Ninja and Shroud to their platform from twitch. But Twitch has been a home to streamers since day 1 and now that Mixer has been shut down, streamers are returning to the platform again.Millions, if not billions, watch twitch streams everyday and i myself like to watch twitch streams. So i put together Top 1000 Streamers from ..