To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 淼夂処淼夲セ玲 11111011010001011001101011100111100011111000100011111011010001011001101011101111101111101001011111100110 fb459ae78f88fb459aefbe97e6
EUC-JP 淼夂処淼夲セ玲 10001111110001111110011011010100111010011011110111101000100011111100011111100110110101001111000110001110101111101100111011101000 8fc7e6d4e9bde88fc7e6d4f18ebecee8
UTF-8 淼夂処淼夲セ玲 111001101011011110111100111001011010010010000010111001011000011110100110111001101011011110111100111001011010010010110010111011111011110110111110111001111000111010110010 e6b7bce5a482e587a6e6b7bce5a4b2efbdbee78eb2
UHC ??????玲 0011111100111111001111110011111100111111001111111101011010111100 3f3f3f3f3f3fd6bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)