Using the boolean checkbox to determine showing the plot and description or not.
Do you want to show the following plot?
The vega-lite plot is displayed.
Load the data in first:
Include a data component and point the source to the csv file we load in.
[data name:"BenderlyZwick" source:"BenderlyZwick.csv" /]
Display the dataset in table:
[Table data:BenderlyZwick /]
Tries to create a simple Vega-lite scatter graph using the dataset above.
Line break symbol: [br /]
1. Where I got it:
Name: BenderlyZwick
Dataset taken from: link
Dataset dictionary and some simple descriptions: link
2. Brief Description of what the data is trying to measure:
The dataset is taken from the link above. From my understanding,
the dataset aims to display the inflation, growth and returns for a given stock. However,
there are no date values in the dataset, rather the date values are recorded in the description.
The Dictionary stated that the values were gathered during the time sereis from 1952 to 1982,
thus constitutes 31 rows. Each row in the table corresponds to one year. I guess the purpose of the dataset is to generally record the return for each given growth, and records the growth at that given point.
3. What I am trying to show in the plot.
I am trying to use a scatter plot to show the relationship between growth and return for these given
entries. The x axis was setted to the return, which is the real annual returns on stocks. The y axis was
setted to the growth, which is the annual growth rate of output, measured by real GNP. With the scatter
plot, we can tell that with a higher growth in GNP, we would tend to have a better return, hence created
a somewhat linear correlation between the two variables.
4. Why I used the vega-lite marks I choose.
I had chosen to use the ‘point’ mark in the graph as I believed that the scatter plot would serve the best
purpose of myself in creating a relation plot between the two variables. I had tried line graphs for the same two variables, and the result was not ideal. Similar with bar graph, it will be very hard to create distinct x values while we have so many different entries for return and growth, and with 0 or very minimal overlap in data values.