To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 韆包スコ遶乗ァォ鬯イB 11101000111001101001010111101111101111011011101011100111101010111000111111100110101001111010101111101001101011001011001001000010 e8e695efbdbae7ab8fe6a7abe9acb242
EUC-JP 韆包スコ遶乗ァォ鬯イB 111100001110100011001010111100011000111010111101100011101011101011101110101011011011111011101000100011101010011110001110101010111111001010101110100011101011001001000010 f0e8caf18ebd8ebaeeadbee88ea78eabf2ae8eb242
UTF-8 韆包スコ遶乗ァォ鬯イB 11101001100111111000011011100101100011001000010111101111101111011011110111101111101111011011101011101001100000011011011011100100101110011001011111101111101111011010011111101111101111011010101111101001101011001010111111101111101111011011001001000010 e99f86e58c85efbdbdefbdbae981b6e4b997efbda7efbdabe9acafefbdb242
UHC 韆包????????B 11110100110001111111100011010000001111110011111100111111001111110011111100111111001111110011111101000010 f4c7f8d03f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)