Categories
Misc

NVIDIA DLI Releases Accelerated Data Science Teaching Kit

The NVIDIA Deep Learning Institute (DLI) released the Accelerated Data Science Teaching Kit, co-developed with Professor Polo Chau from Georgia Institute of Technology and Professor Xishuang Dong from Prairie View A&M University.

This week, the NVIDIA Deep Learning Institute (DLI) released the Accelerated Data Science Teaching Kit, co-developed with Professor Polo Chau from Georgia Institute of Technology and Professor Xishuang Dong from Prairie View A&M University. 

The comprehensive teaching materials cover fundamental and advanced topics in data collection and pre-processing, accelerated data science with RAPIDS, scalable and distributed computing, GPU-accelerated machine learning, data visualization and graph analytics, and addresses the growing need of teaching data science skills to students in higher education and research institutions.

This first release includes focused modules covering: 

  • Introduction to Data Science and RAPIDS
  • Data Collection and Pre-processing (ETL)
  • Data Ethics and Bias in Data Sets
  • Data Integration and Analytics
  • Data Visualization
  • Scalable and Distributed Computing with Hadoop, Hive and Spark

More modules are planned for future releases.

The kit also covers culturally-responsive topics such as fairness and data bias, as well as challenges and important figures from underrepresented groups.

Lecture slides and notes, hands-on labs, iPython notebooks, solutions (held in private repo), sample data sets, quiz/exam questions/answers, GPU compute resources via free AWS cloud credits, and free DLI online courses/certificates are all included. Lecture videos are planned for future releases.

The RAPIDS data science framework is a GPU-accelerated collection of libraries for executing end-to-end data science pipelines completely on the GPU. The primary objective behind using RAPIDS is to accelerate individual parts of the typical data science workflow, and thereby accelerating the complete end-to-end workflow in Data Preparation and Machine Learning. 

One of the first Jupyter notebook-based labs has students dive right into RAPIDS using pandas and cuDF. Pandas is a data analysis and manipulation tool built on top of the Python programming language to perform various tasks (e.g.: loading, joining, aggregating, filtering data). cuDF is a RAPIDS-based GPU DataFrame library that helps perform similar functionalities with GPU acceleration. 

Students are first tasked with understanding how to create DataFrame objects in cuDF, assigning values to those objects, and then calling methods and applying user-defined functions on the values. Once students have a grasp on working with cuDF DataFrames, they are tasked with creating one from a Netflix movie dataset from Kaggle. 

Figure 1. Snapshot of Teaching Kit Module 1: Intro to RAPIDS Lab.

From there, students learn how to manipulate and interrogate the data, from dropping missing columns and values, querying, and finding unique values, to sorting, counting and grouping. The students will get a feel for how fast and easy it is by using RAPIDS and GPUs versus traditional methods also covered in the Teaching Kit. As a bonus task in the lab, students are finally asked to use cuDF One-hot encoding to convert the dataset’s movie and TV show titles to vectors of 0s and 1s to improve the accuracy of analyzing the data.

“Data Science unlocks the immense potential of data in solving societal challenges and large-scale complex problems across virtually every domain, from business, technology, science, engineering, healthcare, to government, and many more,” said Professor Polo Chau. “As data continues to grow in volume, velocity and complexity, there is an ever-increasing demand for data science talent and skill sets to help design the best solutions.”

This is the fourth Teaching Kit as part of the existing program of 7,000 qualified educators.

Get started with NVIDIA Teaching Kits >> 

Categories
Misc

Think Aggressively This GFN Thursday with Outriders Demo, 11 Additional Games

Here comes another GFN Thursday, dropping in to co-op with you as we explore the world of Square Enix’s new Outriders game. Before we get into the rest of this week’s new additions, let’s head to Enoch and take a closer look at what makes People Can Fly’s upcoming launch special. Let’s Ride From the Read article >

The post Think Aggressively This GFN Thursday with Outriders Demo, 11 Additional Games appeared first on The Official NVIDIA Blog.

Categories
Misc

Dancing DNA Revealed in High-Res HPC Simulations

Using the highest-resolution images of a single DNA molecule captured to date, researchers in the U.K. discovered that coiled strands of genetic material twist and writhe while crammed in a cell.

Using the highest-resolution images of a single DNA molecule captured to date, researchers in the U.K. discovered that coiled strands of genetic material twist and writhe while crammed in a cell.

This previously unobserved movement was simulated on GPU-based systems including JADE, a University of Oxford supercomputer made up of NVIDIA DGX systems. Published in Nature Communications, this research could improve scientists’ understanding of DNA mechanics and inform the development of genetic therapies and diagnostics. 

“GPUs have massively changed the capabilities of biomolecular simulation,” said Sarah Harris, associate professor at the University of Leeds and a researcher on the project, a collaboration among multiple universities. 

By pairing high-resolution atomic force microscopy with HPC simulations, the team created the first videos of twisted DNA molecules called minicircles — where both ends of the molecule join together in a loop. Prior research suggests these minicircles might be markers of health, ageing, and disease. 

Using the GPU-accelerated AMBER package, the researchers were able to model DNA structure with state-of-the-art force fields based on highly accurate quantum mechanical simulations. The dynamics of the molecular simulations were visualized with VMD, which was also used to confirm the presence of hydrogen bonding interactions in the coiled DNA structure.

Adding a twist to a strand of DNA, the researchers found, made the molecules more dynamic, displaying dance-like moves that might help the DNA bind to other molecules. 

The minicircle simulations “are extremely exciting because they show, with remarkable detail, how wrinkled, bubbled, kinked, denatured, and strangely shaped they are, which we hope to be able to control someday,” said Professor Lynn Zechiedrich from Baylor College of Medicine, who provided the DNA minicircles for the study.  

The combination of microscopy images and simulation enabled the researchers to see not just the DNA’s double-helix structure, but the position of each atom within the loop, which contains between 250 and 340 base pairs. 

Read the full Nature Communications paper here

Categories
Misc

Model within a model?

How can I take two feature vectors as outputs from two inputs, store them, and compare a feature vector from a third input to the first feature vectors within one model? End result is dataset C is 80% similar to dataset A 20% to dataset B, etc.

submitted by /u/BestUCanIsGoodEnough
[visit reddit] [comments]

Categories
Misc

Why my Model has a low MAE and low R2 score at the same time?

https://stackoverflow.com/questions/66363862/why-my-model-has-a-low-mae-and-low-r2-score-at-the-same-time

submitted by /u/notm3llo
[visit reddit] [comments]

Categories
Misc

Help with debugging

Help with debugging
submitted by /u/iWatchBlack
[visit reddit] [comments]
Categories
Misc

New TensorBoard Integration in VS Code

New TensorBoard Integration in VS Code

In the latest update of VS Code last week, they added support for TensorBoard integration in VS Code. Just wanted to share with everyone!

https://preview.redd.it/lg64jrhqhij61.png?width=2556&format=png&auto=webp&s=1afe8944a278356a3eafbc98039424b62069c505

To launch tensorboard, just open the command palette in VS Code and search for the command “Launch TensorBoard”

https://preview.redd.it/g96z1lxxhij61.png?width=1064&format=png&auto=webp&s=cc1f0bc16bd9141796e88b6fea4878141bbe74c2

It looks like VS Code will automatically look for your TensorBoard log files within your directory.

submitted by /u/evilcubed
[visit reddit] [comments]

Categories
Misc

Help with debugging

Help with debugging
submitted by /u/iWatchBlack
[visit reddit] [comments]
Categories
Misc

New TensorBoard Integration in VS Code

New TensorBoard Integration in VS Code

In the latest update of VS Code last week, they added support for TensorBoard integration in VS Code. Just wanted to share with everyone!

https://preview.redd.it/lg64jrhqhij61.png?width=2556&format=png&auto=webp&s=1afe8944a278356a3eafbc98039424b62069c505

To launch tensorboard, just open the command palette in VS Code and search for the command “Launch TensorBoard”

https://preview.redd.it/g96z1lxxhij61.png?width=1064&format=png&auto=webp&s=cc1f0bc16bd9141796e88b6fea4878141bbe74c2

It looks like VS Code will automatically look for your TensorBoard log files within your directory.

submitted by /u/evilcubed
[visit reddit] [comments]

Categories
Misc

Help with debugging

Help with debugging
submitted by /u/iWatchBlack
[visit reddit] [comments]