To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鞨奇ス、顏ー譎牙喆驍奇ス、髫ー褶画恤B 11101000111000001000101011101111101111011010010011101000111110001011000011100110100110011000100111100101111110101001010111101001100000101000101011101111101111011010010011101001100110101011000011100101111101111000100111100110100111001001010101000010 e8e08aefbda4e8f8b0e69989e5fa95e9828aefbda4e99ab0e5f789e69c9542
EUC-JP 鞨奇ス、顏ー譎牙喆驍奇ス、髫ー褶画恤B 1111000011100010101101001111000110001110101111011000111010100100111100001111101010001110101100001110101111111001101100101110011110001111101101011110100011110001111000101011010011110001100011101011110110001110101001001111000111111010100011101011000011101010111110011011001011101000110101111111010101000010 f0e2b4f18ebd8ea4f0fa8eb0ebf9b2e78fb5e8f1e2b4f18ebd8ea4f1fa8eb0eaf9b2e8d7f542
UTF-8 鞨奇ス、顏ー譎牙喆驍奇ス、髫ー褶画恤B 11101001100111101010100011100101101001011000011111101111101111011011110111101111101111011010010011101001101000011000111111101111101111011011000011101000101011011000111011100111100010011001100111100101100101101000011011101001101010011000110111100101101001011000011111101111101111011011110111101111101111011010010011101001101010111010101111101111101111011011000011101000101001001011011011100111100101001011101111100110100000011010010001000010 e99ea8e5a587efbdbdefbda4e9a18fefbdb0e8ad8ee78999e59686e9a98de5a587efbdbdefbda4e9ababefbdb0e8a4b6e794bbe681a442
UHC 鞨奇????譎牙喆驍奇????褶?恤B 11001010111010101101000011110100001111110011111100111111001111111111110111010010111001001011001111110100110010101111110110100100110100001111010000111111001111110011111100111111111000111010100000111111111111011101000101000010 caead0f43f3f3f3ffdd2e4b3f4cafda4d0f43f3f3f3fe3a83ffdd142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)