Brazilian E-Commerce Public Dataset by Olist
Welcome! This is a Brazilian ecommerce public dataset of orders made at Olist Store. The dataset has information of 100k orders from 2016 to 2018 made at multiple marketplaces in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight performance to customer location, product attributes and finally reviews written by customers. We also released a geolocation dataset that relates Brazilian zip codes to lat/lng coordinates.
This is real commercial da..
There is universal acceptance of statistics as an essential tool for all types of research. That acceptance and ever-proliferating areas of research specialization have led to corresponding increases in the number and diversity of available statistical procedures. In agricultural research, for example, there are different statistical techniques for crop and animal research, for laboratory and field experiments, for genevic and physiological research, and so on. Although this diversit" indicates the aailability of appropriate statistical techniques for mo..
DESCRIPTION
All TV Show details from IMDB
SUMMARY
Context
This dataset was created by our in house teams at PromptCloud (https://www.promptcloud.com/) and DataStock (https://datastock.shop/). This dataset contains a sample of 5K records in it.
Content
This dataset contains the following.
Acknowledgements
We wouldn't be here without the help of our web scraping and data mining experts at PromptCloud and DataStock.
Inspiration
The inspiration for this dataset was made keeping in mind the data analysts and researchers across ..
Healthcare Data Sources and Basic Analytics: These chapters discuss the details about the various healthcare data sources and the analytical techniques that are widely used in the processing and analysis of such data. The various forms of patient data include electronic health records, biomedical images, sensor data, biomedical signals, genomic data, clinical text, biomedical literature, and data gathered from social media.Advanced Data Analytics for Healthcare: These chapters deal with the advanced data analytical methods focused on healthcare. These in..
Data scientist has been called “the sexiest job of the 21st century,” presumably by someone who has never visited a fire station. Nonetheless, data science is a hot and growing field, and it doesn’t take a great deal of sleuthing to find analysts breathlessly prognosticating that over the next 10 years, we’ll need billions and billions more data scientists than we currently have. But what is data science? After all, we can’t produce data scientists if we don’t know what data science is. According to a Venn diagram that is somewhat famous in the industry,..
Sales Prediction for Big Mart Outlets
The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim is to build a predictive model and predict the sales of each product at a particular outlet.
Using this model, BigMart will try to understand the properties of products and outlets which play a key role in increasing sales.
Please note that the data may have missing values as some stores might not report all the data due ..
Description
In the two datasets freMTPLfreq and freMTPLsev, risk features are collected for 413,169 motor third-party liability policies (observed mostly on one year), in addition to claim numbers by policy as well as their corresponding claim amounts. freMTPLfreq contains the risk features and claim counts, whilst freMTPLsev contains claim amounts. Both tables can be linked together via the corresponding policy ID.
Additional information can be found at http://cas.uqam.ca/pub/web/CASdatasets-manual.pdf.
A health indicator is a measure designed to summarize information about a given priority topic in population health or health system performance. Health indicators provide comparable and actionable information across different geographic, organizational, or administrative boundaries and/or can track progress over time.
Context
In the dataset freMTPL2freq risk features and claim numbers were collected for 677,991 motor third-part liability policies (observed on a year).
Content
freMTPL2freq contains 11 columns (+IDpol): • IDpol The policy ID (used to link with the claims dataset). • ClaimNb Number of claims during the exposure period. • Exposure The exposure period. • Area The area code. • VehPower The power of the car (ordered categorical). • VehAge The vehicle age, in years. • DrivAge The driver age, in years (in France, people can drive a car at 18). • BonusMalus ..