To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????\ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ??????????普?????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111100101011000000100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f95813f3f3f3f3f3f3f3f3f5c
EUC-JP ??????????普?????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111110010011110000100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3fc9e13f3f3f3f3f3f3f3f3f5c
UTF-8 렻렓렺읕렦렻렞렺렱렺普렔렺旼씽렞렺렰렺렰\ 11101011101000001011101111101011101000001001001111101011101000001011101011101100100111011001010111101011101000001010011011101011101000001011101111101011101000001001111011101011101000001011101011101011101000001011000111101011101000001011101011100110100110011010111011101011101000001001010011101011101000001011101011100110100101111011110011101100100101001011110111101011101000001001111011101011101000001011101011101011101000001011000011101011101000001011101011101011101000001011000001011100 eba0bbeba093eba0baec9d95eba0a6eba0bbeba09eeba0baeba0b1eba0bae699aeeba094eba0bae697bcec94bdeba09eeba0baeba0b0eba0baeba0b05c
UHC 렻렓렺읕렦렻렞렺렱렺普렔렺旼씽렞렺렰렺렰\ 1000111011000011100011101010100010001110110000101100000011000100100011101011010110001110110000111000111010101111100011101100001010001110101111101000111011000010110111001100010110001110101010011000111011000010110110101100010010111110110001011000111010101111100011101100001010001110101111011000111011000010100011101011110101011100 8ec38ea88ec2c0c48eb58ec38eaf8ec28ebe8ec2dcc58ea98ec2dac4bec58eaf8ec28ebd8ec28ebd5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)