To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????±?? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101100010011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f3f
SJIS-WIN ?西????德?????上?±?曇 00111111100100001011110000111111001111110011111100111111111110101011101000111111001111110011111100111111001111111000111111100011001111111000000101111101001111111001001111011100 3f90bc3f3f3f3ffaba3f3f3f3f3f8fe33f817d3f93dc
EUC-JP ?西??????????上?±?曇 001111111100000010111110001111110011111100111111001111110011111100111111001111110011111100111111001111111011111011100101001111111010000111011110001111111100011011011110 3fc0be3f3f3f3f3f3f3f3f3f3fbee53fa1de3fc6de
UTF-8 렊西롆쒔롐뤦德쳩쩐춲즼죳上춲±춲曇 1110101110100000100010101110100010100101101111111110101110100001100001101110110010010010100101001110101110100001100100001110101110100100101001101110010110111110101101111110110010110011101010011110110010101001100100001110110010110110101100101110110010100110101111001110110010100011101100111110010010111000100010101110110010110110101100101100001010110001111011001011011010110010111001101001101110000111 eba08ae8a5bfeba186ec9294eba190eba4a6e5beb7ecb3a9eca990ecb6b2eca6bceca3b3e4b88aecb6b2c2b1ecb6b2e69b87
UHC 렊西롆쒔롐뤦德쳩쩐춲즼죳上춲±춲曇 10001110101000011110000010100100100011101100110010111110101011011000111011010110100011111101010011010011111011001010101110001110110000101011111010101101100011101010001110001110101000011000111011011111101111101010110110001110101000011011111010101101100011101101001110111110 8ea1e0a48eccbead8ed68fd4d3ecab8ec2bead8ea38ea18edfbead8ea1bead8ed3be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)