Human Archive, a startup focused on physical AI and robotics, is building what it aims to be one of the world’s largest datasets for training robots by recording how people perform everyday physical tasks in real-world environments.
Founded by young entrepreneurs Rushil Agarwal, Raj Patel, Samay Maini and Shloke Patel, the company is focused on capturing what it calls “human embodied intelligence” — the movements, actions and decision-making processes people use while carrying out physical work.
To collect this data, Human Archive uses custom-built hardware equipped with 4K cameras, depth sensors, tactile gloves, wrist-mounted cameras and body motion sensors. The recorded information is then processed, anonymized and converted into structured datasets that can be used by robotics companies and AI research organizations.
According to the company, India serves as its primary data collection hub due to its large workforce and diverse industrial landscape. Human Archive says it has partnered with more than 125 organizations across sectors including hospitality, restaurants, quick commerce, construction and manufacturing.
The startup reports that it has already gathered tens of thousands of hours of data and plans to expand its collection efforts further. Its long-term goal is to help robots learn complex physical tasks that can be applied across homes, factories, warehouses and other real-world settings.
Supporters of the initiative believe that robotics and physical AI require extensive real-world behavioral datasets, similar to how large language models were trained using massive volumes of internet-based text. Human Archive argues that recording human interaction with the physical world can help accelerate the development of more capable robotic systems.
At the same time, the project has generated discussions around privacy, consent and workforce implications. Some critics have questioned whether workers fully understand how their recorded data may be used and whether such datasets could eventually contribute to the automation of jobs currently performed by people.
The rise of the Human Archive reflects a broader shift within the AI industry toward embodied intelligence, where the focus is moving beyond text and image generation to enabling machines to understand and operate in physical environments.
As investment in robotics and physical AI continues to grow, access to large-scale real-world datasets is increasingly emerging as a critical asset for companies developing the next generation of intelligent machines.
Also read: Viksit Workforce for a Viksit Bharat
Do Follow: The Mainstream LinkedIn | The Mainstream Facebook | The Mainstream Youtube | The Mainstream Twitter
About us:
The Mainstream is a premier platform delivering the latest updates and informed perspectives across the technology business and cyber landscape. Built on research-driven, thought leadership and original intellectual property, The Mainstream also curates summits & conferences that convene decision makers to explore how technology reshapes industries and leadership. With a growing presence in India and globally across the Middle East, Africa, ASEAN, the USA, the UK and Australia, The Mainstream carries a vision to bring the latest happenings and insights to 8.2 billion people and to place technology at the centre of conversation for leaders navigating the future.


