To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???庄?Р絶??^ 00111111001111110011111110001111101011110011111110000100010100011001000011100010001111110011111101011110 3f3f3f8faf3f845190e23f3f5e
EUC-JP ???庄?Р絶??^ 00111111001111110011111110111110101100010011111110100111101100101100000011100100001111110011111101011110 3f3f3fbeb13fa7b2c0e43f3f5e
UTF-8 遼뀐풇庄들Р絶륃넶^ 111011111010011110000011111010111000000010010000111011011001001010000111111001011011101010000100111010111001001110100100110100001010000011100111101101011011011011101011101001011000001111101011100001001011011001011110 efa783eb8090ed9287e5ba84eb93a4d0a0e7b5b6eba583eb84b65e
UHC 遼뀐풇庄들Р絶륃넶^ 11101001101011001011001011101111101111101000111111101101111001001011010111101001101011001011001011101111101111101000111111101110100001101011001101011110 e9acb2efbe8fede4b5e9acb2efbe8fee86b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)