To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????斤????????寶普??似? 001111110011111100111111001111110011111110001011110100100011111100111111001111110011111100111111001111110011111100111111100110111000111110010101100000010011111100111111100011101001011100111111 3f3f3f3f3f8bd23f3f3f3f3f3f3f3f9b8f95813f3f8e973f
EUC-JP ?????斤????????寶普??似? 001111110011111100111111001111110011111110110110110101000011111100111111001111110011111100111111001111110011111100111111110101011110111111001001111000010011111100111111101110111111011100111111 3f3f3f3f3fb6d43f3f3f3f3f3f3f3fd5efc9e13f3fbbf73f
UTF-8 렻렔렺렚렺斤렞렺렰렺렖렻렓렻寶普렔렺似렜 111010111010000010111011111010111010000010010100111010111010000010111010111010111010000010011010111010111010000010111010111001101001011010100100111010111010000010011110111010111010000010111010111010111010000010110000111010111010000010111010111010111010000010010110111010111010000010111011111010111010000010010011111010111010000010111011111001011010111110110110111001101001100110101110111010111010000010010100111010111010000010111010111001001011110010111100111010111010000010011100 eba0bbeba094eba0baeba09aeba0bae696a4eba09eeba0baeba0b0eba0baeba096eba0bbeba093eba0bbe5afb6e699aeeba094eba0bae4bcbceba09c
UHC 렻렔렺렚렺斤렞렺렰렺렖렻렓렻寶普렔렺似렜 10001110110000111000111010101001100011101100001010001110101011011000111011000010110100001100010110001110101011111000111011000010100011101011110110001110110000101000111010101011100011101100001110001110101010001000111011000011110111001100010011011100110001011000111010101001100011101100001011011110110001001000111010101110 8ec38ea98ec28ead8ec2d0c58eaf8ec28ebd8ec28eab8ec38ea88ec3dcc4dcc58ea98ec2dec48eae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)