To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 瑚?瑚驀?諍?瑚驀 100011001110100000111111100011001110100011101001011111010011111111100110011110010011111110001100111010001110100101111101 8ce83f8ce8e97d3fe6793f8ce8e97d
EUC-JP 瑚?瑚驀?諍?瑚驀 101110001110101000111111101110001110101011110001110111100011111111101011110110100011111110111000111010101111000111011110 b8ea3fb8eaf1de3febda3fb8eaf1de
UTF-8 瑚렋瑚驀득諍렋瑚驀 111001111001000110011010111010111010000010001011111001111001000110011010111010011010100110000000111010111001001110011101111010001010101110001101111010111010000010001011111001111001000110011010111010011010100110000000 e7919aeba08be7919ae9a980eb939de8ab8deba08be7919ae9a980
UHC 瑚렋瑚驀득諍렋瑚驀 111110111101000110001110101000101111101111010001110110001110100110110101111001101110111010110101100011101010001011111011110100011101100011101001 fbd18ea2fbd1d8e9b5e6eeb58ea2fbd1d8e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)