To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鈺?К嚴??窈??腋?????蘖??節?B 11111011110001000011111110000100010010111001101010001110001111110011111111100010011101110011111100111111111000111111110000111111001111110011111100111111001111111001111101010000001111110011111110010000110111110011111101000010 fbc43f844b9a8e3f3fe2773f3fe3fc3f3f3f3f3f9f503f3f90df3f42
EUC-JP 鈺?К嚴??窈??腋?????蘖??節?B 1000111111100011110101010011111110100111101011001101001111101110001111110011111111100011110110000011111100111111111001101111111000111111001111110011111100111111001111111101110110110001001111110011111111000000111000010011111101000010 8fe3d53fa7acd3ee3f3fe3d83f3fe6fe3f3f3f3f3fddb13f3fc0e13f42
UTF-8 鈺싩К嚴싪춾窈뚳슈腋잍릹樂띶윜蘖쀨썪節큆B 111010011000100010111010111011001000101110101001110100001001101011100101100110101011010011101100100010111010101011101100101101101011111011100111101010101000100011101011100110101011001111101100100010101000100011101000100001011000101111101100100111101000110111101011101001101011100111101111101001101011111111101011100111011011011011101100100111001001110011101000100110001001011011101100100000001010100011101100100011011010101011100111101011111000000011101101100000011000011001000010 e988baec8ba9d09ae59ab4ec8baaecb6bee7aa88eb9ab3ec8a88e8858bec9e8deba6b9efa6bfeb9db6ec9c9ce89896ec80a8ec8daae7af80ed818642
UHC 鈺싩К嚴싪춾窈뚳슈腋잍릹樂띶윜蘖쀨썪節큆B 1110100010101101100110101110011110101100101011001110010111110001100110101110100010101101100110101110100110100001100011001110111110111101101101001110010011111101100111111110011010010000100101111110100011111001100011011110010110011111100111111110010111101110100101111110100010011011100110111110111110111101101101000101001101000010 e8ad9ae7acace5f19ae8ad9ae9a18cefbdb4e4fd9fe69097e8f98de59f9fe5ee97e89b9befbdb45342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)