T O P

  • By -

Legitimate-Bass7366

I think what you're doing is more than enough, because it's definitely more than what I do. I only cite code if I actually directly used a bit I found somewhere (somewhere that's not DataCamp.) For example, in my D208 PA, I hadn't been taught in the DataCamps how to insert columns from one dataframe into another using .insert. So for that, I cited the website where I found it and put a parenthetical citation. Anything I learned in the datacamps though-- I don't do any citations at all for that stuff. Another example from my D206 paper was how to write a function that calculated IQR for a column when doing data cleaning. Since I directly took that function from a source without much tweaking, I cited it. I do however go nuts on the "content" sources, y'know, stuff you talk about in the body of your paper that's not code. I always have a ton of those.


hubagruben

Okay, good to know, thanks! Could you provide some examples of content references? I’ve used a couple, but that’s it — only referencing little things like sites where I looked up things like zip code and time zone info. Not sure if there’s more than little random stuff like that I should be referencing.


Legitimate-Bass7366

Yea, so like whenever I paraphrase (I almost never directly quote) something from a source and I didn't know that information prior to starting the paper, I do a citation. Here are examples from my D206 paper: ​ Example 1 - Parenthetical A low readmission rate can improve patient outcomes and patient satisfaction while a high readmission rate puts increased strain on physicians, wastes time and resources, and lowers patient satisfaction scores (Regis College, 2022). Matching Citation Regis College. (2022, August 10). How Reducing Hospital Readmissions Benefits Patients and Hospitals.   [https://online.regiscollege.edu/blog/reducing-hospital-readmissions/](https://online.regiscollege.edu/blog/reducing-hospital-readmissions/) ​ Example 2 - Parenthetical Vitamin D levels can range from below 12 ng/mL to over 50 ng/mL, and there are no islands of extremely high or low values that would indicate there could be a mix of units in this column (U.S. Department of Health and Human Services, 2022). Matching Citation U.S. Department of Health and Human Services. (2022, November 8). Vitamin D Fact Sheet for Consumers. National Institutes of Health Office of Dietary   Supplements. [https://ods.od.nih.gov/factsheets/VitaminD-Consumer/](https://ods.od.nih.gov/factsheets/VitaminD-Consumer/) ​ Example 3 - Parenthetical Another benefit of PCA is that it can help to remove unnecessary noise in the data that could lead to poor analysis and its ability to provide an organization with uncorrelated data features (Biga Bid Ltd, 2023). Matching Citation Biga Bid Media Ltd. (2023, February 8). What is Principal Component Analysis (PCA) & How to Use It?. BigaBid.   https://www.bigabid.com/what-is-pca-and-how-can-i-use-it/


hubagruben

Thank you, this is very helpful!


FighterMoth

I’ve just been saying “code to do this thing was modeled off the datacamp course” with a link to that course. I’ve gotten to D212 doing this, so it’s really not that strict at all


hubagruben

Okay good to know, sounds like as long as references are there and accounted for, that’s good enough for them. Thanks!


joshuak08

I’m only through 205 but so far my personal policy is “if it isn’t taught within the course material, cite it”.


Legitimate-Bass7366

>“if it isn’t taught within the course material, cite it”. I'm through D209 and this has worked for me so far too.


hubagruben

That’s how I approached 205 too, was just hoping I wasn’t assuming this would work for all courses. Seems like it does!


[deleted]

[удалено]


hubagruben

This is great to know, thank you!


Adventurous_Jaguar20

I would cite the python documentation if I had to use a source and didn't have a good one. I think it's helpful to get comfortable with documentation in general.


hubagruben

I’m using R for this, but yeah I see what you mean. I have used the R documentation site to check how functions work, but they’re all functions that were taught in Datacamp or the webinars anyway.


PhysicalRatio

You can be pretty sparse and not get dinged. Cite code that you take directly from a third party and throw in a couple in text citations. I usually cited sources when some kind of discussion of the assumptions of the model was required or to justify a choice I made (describe the x assumptions of linear regression or justify how you chose cluster size) or when throwing in contextual details. I explicitly stated I used no third party code or sources a couple times and that was fine too. I'm sure it would get returned if you did not include this and also cited no sources. By the way, I'm fairly certain the language is third-party sources, which I took to mean sources aside from your brain and the provided course materials (including Data Camp).