To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN ?禪禪?禪禪B 0011111111100010010101111110001001010111001111111110001001010111111000100101011101000010 3fe257e2573fe257e25742
EUC-JP ?禪禪?禪禪B 0011111111100011101110001110001110111000001111111110001110111000111000111011100001000010 3fe3b8e3b83fe3b8e3b842
UTF-8 볲禪禪볲禪禪B 11101011101100111011001011100111101001101010101011100111101001101010101011101011101100111011001011100111101001101010101011100111101001101010101001000010 ebb3b2e7a6aae7a6aaebb3b2e7a6aae7a6aa42
UHC 볲禪禪볲禪禪B 10010011111110001110000011001001111000001100100110010011111110001110000011001001111000001100100101000010 93f8e0c9e0c993f8e0c9e0c942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)