86个大数据和区块链的关系,86个大数据和区块链的区别
请查看相关英文文档
① What is the relationship between blockchain and big data?
Blockchain and big data are both hot topics, and the development of big data precedes the development of blockchain. Blockchain has now become a huge industry, and combining the developing blockchain technology with big data will produce different effects. From a technical perspective, big data technology exchanges trust for computing resources, while blockchain technology exchanges computing resources for trust, so the combination of the two has set off a new wave of information security. Based on the characteristics of blockchain distributed data storage, decentralization, non-tampering, traceability, and trustworthiness, Chongqing Jinwowo Network Technology Group has established a strong blockchain research team to professionally provide blockchain as the underlying technology. big data services.
② What is the Internet of Things, what is blockchain, what is big data
The development of blockchain to this day has extended from the initial financial transactions to all transactions that require middlemen for guarantees Or certified application projects, such as house transactions, car sales, etc., can even integrate blockchain technology with other application services through API connection, thereby accelerating the production of various innovative applications, and even helping Accelerate the development of Internet of Things applications.
The earliest application of blockchain is Bitcoin. Blockchain is a series of data blocks generated using cryptographic methods. It is like a database ledger, and the ledger also contains data from Bitcoin. All transaction records since then include multiple block records. Each block corresponds to a part of the transaction and records the hash value of the previous block, forming a chain-like data structure.
Many experts believe that the emergence of blockchain technology solves the missing link in the security, privacy and reliability issues of the Internet of Things. It can be used to track billions of interconnected devices, facilitate transactions and collaborative processing between devices, and save a lot of costs for the IoT industry. This decentralized approach will eliminate single points of failure and create a more reliable of devices running the ecosystem. At the same time, the encryption algorithm used in the blockchain can also bring higher privacy to consumer data.
The advantage of the blockchain is that it is public, and every network participant can see the blocks and the transaction information stored in them. However, this does not mean that everyone can see your actual transaction content, which is protected by your private key.
Blockchain is decentralized. There is therefore no single authority that can approve transactions or set special rules for the receipt of transactions. This means that there is a huge amount of trust between participants, so all network participants must reach a consensus to receive transactions.
The more important point is that the blockchain is very secure. This kind of data can only be continuously expanded, and previous records cannot be changed. And the ledger used by the blockchain is tamper-proof and cannot be manipulated by criminals. This ledger is not located in a specific location and cannot be attacked on middlemen because no single communication thread can be intercepted. .
Blockchain can be applied to the Internet of Things to ensure information security. For example, manufacturers of equipment and instruments can use blockchain technology to trace back the manufacturer, production date, manufacturing batch number and other aspects of the manufacturing process of each component. information to ensure the transparency and plasticity of the entire machine production process, effectively improve the availability of the overall system and components, and thereby ensure the safety of the operation of equipment and instruments.
The unique consensus mechanism of the blockchain connects various devices through a point-to-point method, rather than through a central processor. Each device maintains consensus and does not require central verification, thus ensuring When a node has a problem, it will not affect the overall data security of the network.
Now, as blockchain technology continues to develop and heat up, Shenzhen Beihang Internet of Things Research Institute (www.buaaiot.org) believes that it will also change and integrate into various fields in its own unique way. industry. The birth of blockchain technology has made various algorithms more efficient. Its decentralized principles and distributed computing rules allow everything to be connected and calculated in seconds, and prevent hackers from attacking data. All kinds of assets can be realized in the area. Registration, transactions and data on the block can never be tampered with or reversed. This intangible charm allows various assets to gather on the block.
③ What is the Internet of Things, what is big data, what is the blockchain
1. What is the Internet of Things
In fact, to put it simply, big data assists decision-making by analyzing and mining a full amount of non-sampled data.
Characteristics of big data
Big data refers to massive and high-volume data that are intended to serve the purpose of decision-making and require new data processing models to collect, store, manage and analyze their content. Growth rates and diverse information capital.
Big data has the following essential characteristics:
1. The fundamental purpose is to serve decision-making. Big data can help various organizations and individuals greatly improve their decision-making capabilities and make better decisions. Decision-making and judgment;
2. Large scale, big data usually refers to the amount of data above 100T, which is difficult to rely on traditional computing methods to effectively calculate, but must rely on new computing methods and data mining tools;< /p>
3. High frequency. Big data is data generated by user participation and interaction. It is used to understand user-related data in a timely manner based on the user’s network traces. This kind of data is high-frequency based on days or even hours. data. The frequency of traditional data is very low, and a lot of data is calculated by month or even year;
4. Fast, big data is real-time data and can respond in real time. For example, entering a keyword in the Internet search box can be displayed instantly, while traditional data collection methods are seriously lagging behind;
5. Always online. Online is a prerequisite for big data. From this perspective, big data is always online and can be called at any time. By analyzing user traces on various network terminals, big data can better analyze user behavior, emotions,thoughts, hobbies and needs to make better decisions and analyses.
Three key points of big data
First, the availability of data. At present, the development of big data in China is seriously restricted by the lack of openness of government information. Many data are difficult to obtain, making it difficult to realize real big data mining and analysis. This requires the government to open more data in a timely manner to improve the reliability of the data. Gain degrees.
Secondly, carry out scientific model construction. The scientific nature of the model directly determines the quality of data analysis, which requires a high level of modeling. Of course, a larger amount of data will also help in the reasonable construction of the model.
Third, use experts to refine opinions. Unique, high-quality perspectives based on data mining that provide the basis for decision-making are highly dependent on high-quality data interpretation, which reflects the value of industry experts.
Provided by IoT Media
④ Are big data and blockchain the same thing?
Not the same thing
Big data refers to Technical difficulties caused by excessive data volume are mainly 4V, large data volume, fast data generation, diverse data formats, and low data value
There are many technical problems that need to be solved
Blockchain is an information technology terminology of the field. In essence, it is a shared database, and the data or information stored in it has the characteristics of "unforgeable", "full traces left", "traceable", "open and transparent" and "collectively maintained".
⑤ What is the relationship between blockchain and big data analysis
The relationship between blockchain and big data storage is as follows:
1. Data security: Zone Blockchain allows data to truly flow "with confidence"
With its trustworthiness, security and non-tamperability, blockchain allows more data to be liberated. Let’s use a typical case to illustrate how blockchain promotes the generation of gene sequencing big data. Blockchain sequencing can use private keys to limit access rights, thereby circumventing legal restrictions on individuals' access to genetic data, and utilize distributed computing resources to complete sequencing services at low cost. The security of blockchain makes sequencing an industrialized solution, enabling global-scale sequencing, thus promoting the massive growth of data.
2. Data openness and sharing: Blockchain ensures data privacy
The government holds a large amount of high-density and high-value data, such as medical data, population data, etc. The openness of government data is a general trend and will have an immeasurable impetus to the development of the entire economy and society. However, the main difficulty and challenge in data openness is how to open data while protecting personal privacy. Blockchain-based data desensitization technology can ensure data privacy and provides a solution for data openness under privacy protection. Data desensitization technology mainly uses encryption algorithms such as hashing. For example, the Enigma system based on blockchain technology can calculate data without accessing the original data, which can protect the privacy of data and eliminate information security issues in data sharing.question. For example, company employees can safely open access to their salary information and jointly calculate the average salary within the group. Each participant is informed of his relative position within the group but not of the compensation of other members.
Schematic diagram of data HASH desensitization processing
3. Data storage: Blockchain is an immutable, full-historical, strongly endorsed database storage technology. Blockchain technology is shared by all nodes in the network. Participating in calculations and mutually verifying the authenticity of their information to reach a consensus across the entire network, it can be said that blockchain technology is a specific database technology. So far, our big data is still in a very basic stage. Trusted blockchain data based on the consensus of the entire network is non-tamperable and full of history, which also enables the quality of the data to gain unprecedented strong trust endorsement. It also brings the development of database into a new era.
4. Data analysis: Blockchain ensures data security
Data analysis is the core of realizing data value. When conducting data analysis, how to effectively protect personal privacy and prevent core data leakage has become the primary consideration. For example, with the popularization of fingerprint data analysis applications and genetic data detection and analysis methods, more and more people are worried that once personal health data is leaked, it may lead to serious consequences. Blockchain technology can prevent such situations through multi-signature private keys, encryption technology, and secure multi-party computing technology. When the data is hashed and placed on the blockchain, digital signature technology is used to allow only authorized people to access the data. The private key ensures data privacy and can be shared with authorized research institutions. The data is uniformly stored on the decentralized blockchain, and data analysis can be performed without accessing the original data. This can not only protect the privacy of the data, but also safely provide it to global scientific research institutions and doctors for sharing. The basic health database will bring great convenience to solve sudden and difficult diseases in the future.
5. Data circulation: Blockchain protects data-related rights and interests
For individuals or institutions’ valuable data assets, blockchain can be used to register them, and transaction records are recognized and transparent by the entire network. , traceable, clarifying the source, ownership, use rights and circulation path of big data assets, which is of great value to data asset transactions.
On the one hand, blockchain can eliminate the threat of intermediaries copying data and help establish a trustworthy data asset trading environment. Data Chunxin is a very special commodity, which is essentially different from ordinary commodities. It mainly has characteristics such as unclear ownership and "it is owned after seeing it and copying it". This also determines that the transaction method using traditional commodity intermediaries cannot satisfy the data requirements. sharing, exchanging and trading. Because the intermediary center has the conditions and ability to copy and save all the passing Pazhoulun data, this is extremely unfair to the data producers. This threat cannot be eliminated by relying on promises alone, and the existence of this threat has also become a huge obstacle to the flow of data. Based on decentralized blockchain, it can eliminate the need for intermediary centers to copy data.threats and protect the legitimate rights and interests of data owners.
On the other hand, blockchain provides a traceable path and can effectively solve the problem of data verification. The blockchain jointly participates in the calculation and recording of data through multiple nodes participating in calculations in the network, and mutually verifies the validity of their information. It can not only prevent information counterfeiting, but also provide a traceable path. By stringing together the transaction information of each block, a complete list of transaction details is formed. The ins and outs of each transaction are very clear and transparent. In addition, when people have doubts about the "value" of a certain block, they can easily review historical transaction records to determine whether the value is correct, and identify whether the value has been tampered with or recorded incorrectly.
Everything is guaranteed on the blockchain, and big data will naturally become more active.
The tokens of the crowdfunding projects on the Biying China platform are all developed based on blockchain technology, and relevant information will be recorded on the blockchain.
⑥ Big Data and Blockchain
This story still starts with Bitcoin
Bitcoin, an electronic cash system, is simultaneously disintermediated ( Electronic cash between individuals does not require the intervention of a trusted third-party intermediary) and is decentralized (maintained by an institution) (both parties to the transaction can complete transactions without establishing a trust relationship)
< br /> Hash function: Convert an arbitrarily long string into a fixed-length output (the calculation process cannot be too complicated). As long as the input string changes slightly, the output of the hash function will be completely different.
Blockchain: Divide large things into many blocks for storage. As long as one thing is tampered with and the data below are different, it will be discovered
Uses blockchain (data structure hash function) to ensure that the ledger cannot be tampered with, uses digital signature technology to ensure that only you can use your own account, and uses p2p network and POW consensus mechanism to ensure decentralized operation.
Blockchain is a method that uses block chain data structures to verify and store data, uses distributed node consensus algorithms to generate and update data, and uses cryptography to ensure data transmission and access security. A new distributed infrastructure and computing paradigm.
Three elements:
The essence of blockchain is a distributed ledger, a kind of database. The blockchain uses a hash algorithm to ensure that information cannot be tampered with, uses public keys and private keys to identify identities, and collectively maintains a reliable database in a decentralized and disintermediated manner.
The differences between big data and blockchain are mainly reflected in the following aspects.
(1) Data volume. Blockchain technology is a new application model of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithms. Blockchain processes small amounts of data and has detailedprocessing method. Big data manages massive amounts of data, which requires breadth and quantity, and the processing methods will be rougher.
(2) Structured and unstructured. Blockchain is a block with a rigorously defined structure. The chain composed of pointers is a typical structured data, while big data needs to process more unstructured data.
(3) Independence and integration. In order to ensure security, the blockchain system's information is relatively independent, while the focus of big data is the integrated analysis of information.
(4) Direct and indirect. Blockchain is a distributed ledger, which is essentially a database, while big data refers to in-depth analysis and mining of data, which is a kind of indirect data.
(5)CAP theory. C (Consistency) means consistency, which means that any read operation can always read the result of the previously completed write operation. That is, in a distributed environment, data at multiple points is consistent. A (Availability) refers to availability, which refers to obtaining data quickly and returning operation results within a certain time. P (Tolerance of Network Partition) is partition tolerance, which means that when a network partition occurs (that is, some nodes in the system cannot communicate with other nodes), the separated system can still operate normally. CAP theory tells us that a distributed system cannot meet the three requirements of consistency, availability, and partition tolerance at the same time. It can only meet two of them at the same time. As the saying goes, "You can't have your cake and eat it too." Big data usually chooses to implement AP, and blockchain chooses to implement CP.
(6) Basic network. The underlying infrastructure of big data is usually a computer cluster, while the infrastructure of the blockchain is usually a P2P network.
(7) Source of value. For big data, data is information, and value needs to be extracted from the data. For blockchain, data is an asset and the inheritance of value.
(8) Calculation mode. In the big data scenario, one thing is assigned to multiple people. For example, in the MapRece computing framework, a large task will be decomposed into many subtasks and assigned to many nodes for calculation at the same time. In the blockchain scenario, multiple people are allowed to do one thing repeatedly. For example, many nodes in the P2P network record a transaction at the same time.
⑦ Explanation of the term blockchain
Blockchain is a series of verified blocks, each of which is connected to the previous block, all the way to the creation world block.
1. Blockchain (BlockChain)
Blockchain is a series of verified blocks, each of which is connected to the previous block. to the genesis block. Blockchain is the underlying technology of digital currencies such as BitcoinTechnology is a decentralized distributed shared ledger. Blockchain, artificial intelligence, and big data are known as the three major directions of financial technology. Ma Chajue
In 2009, he built an open source project for the Bitcoin system and officially announced the birth of Bitcoin. But when Bitcoin gradually became popular, Satoshi Nakamoto quietly left and disappeared from the Internet.
⑧ What is the relationship between blockchain and big data?
Blockchain and big data are both hot topics. Many people discuss them together and hope to conflict with more hot
points. Big data has become a huge industry long before the development of blockchain. Although blockchain is in the early stages of the
industry, many technologies and business models are still being explored.
In view of the relative maturity of the big data industry, we use big data as a starting point to study how blockchain technology interfaces with all aspects of the big data
industry.
The big data core industry chain can be roughly divided into the following three parts:
Product part: Including big data software product-related industries such as big data basic software and big data application software.
Data part: including data sources (data collection, data provision), data circulation (data transactions, data sharing
) and other industries directly related to data.
Services: related to big data infrastructure services (data storage, data transmission, data cleaning, data desensitization
, etc.), big data analysis services and big data application services services.
01 Product Section
The combination of software products and blockchain should be based on technology. Big data technology and blockchain technology have similarities: they
are distributed architectures.
But they also have obvious differences: the use of distributed technology in big data technology is computing resources - utilizing the computing resources of multiple
machines and will not be used by a single machine Processing tasks are distributed among multiple computers, each
machine. By processing different tasks, it integrates multiple computing resources to form powerful data processing capabilities.
Using distributed technology in blockchain allows multiple entities to trust each other. Each mainframe participates in the operation of the entire blockchain through the computers it controls
. Each computer runs basically the same task, and the entire blockchain
Through repeated redundant calculations, mutual trust between multiple entities can be achieved.
From a technical perspective, big data technology uses trust to exchange computing resources, while blockchain technology uses computing resources to exchange trust. Due to the differences between the two, it is technically difficult to find a suitable collision point between big data and blockchain.
02 Data Part
In the various formats of the data part, the blockchain can find its place.
The chain is of little significance. However, if multiple entities are involved in data collection and data provision, blockchain can
come into its own.
In order to solve the trust problem between multiple entities, each entity broadcasts its own collected data to all consumers
and the hash of that data Values are stored in the blockchain. Based on the hash value on the blockchain, each subject in the blockchain
can verify whether the complete data it received has been tampered with. Blockchain traceability and non-repudiation
make data provided by multiple parties more credible. At the same time, this method also helps maintain the integrity of big data
.
In the data circulation industry, blockchain can play a greater role. In the big data circulation industry, big data itself
is a digital asset. The transaction of digital assets can be realized through the blockchain. Additionally,
decentralized big data transactions implemented on the blockchain can reduce raw data connections. In a decentralized trading platform, only buyers and sellers
have access to the original data; on a centralized trading platform, the data intermediary serving as the trading center usually has access to the original
data. This increases data leakage and assets. Risk of Loss.
03 Service Part
In big data services, the service capabilities as digital assets are the same as the concepts of data and assets in data transactions
and Can be traded.
Blockchain can play an important role in the trading of such digital assets. At the same time, there are many new technologies that are constantly changing the blockchain, such as homomorphic encryption. Homomorphic encryption is an important technology that combines blockchain and big data services
. Homomorphic encryption can truly transform data service functions into digital assets without requiring the big data service provider to copy the original data in the service. risk.