To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蟾舌o蠑群蟾舌o莉к蟾舌o蠑群蟾舌o莉кB 1110010110110111100100001110001110000010100011111110010110111100100011000101000111100101101101111001000011100011100000101000111111100100101110111000010001111011111001011011011110010000111000111000001010001111111001011011110010001100010100011110010110110111100100001110001110000010100011111110010010111011100001000111101101000010 e5b790e3828fe5bc8c51e5b790e3828fe4bb847be5b790e3828fe5bc8c51e5b790e3828fe4bb847b42
EUC-JP 蟾舌o蠑群蟾舌o莉к蟾舌o蠑群蟾舌o莉кB 1110101010111001110000001110010110100011111011111110101010111110101101111011001011101010101110011100000011100101101000111110111111101000101111011010011111011100111010101011100111000000111001011010001111101111111010101011111010110111101100101110101010111001110000001110010110100011111011111110100010111101101001111101110001000010 eab9c0e5a3efeabeb7b2eab9c0e5a3efe8bda7dceab9c0e5a3efeabeb7b2eab9c0e5a3efe8bda7dc42
UTF-8 蟾舌o蠑群蟾舌o莉к蟾舌o蠑群蟾舌o莉кB 1110100010011111101111101110100010001000100011001110111110111101100011111110100010100000100100011110011110111110101001001110100010011111101111101110100010001000100011001110111110111101100011111110100010001110100010011101000010111010111010001001111110111110111010001000100010001100111011111011110110001111111010001010000010010001111001111011111010100100111010001001111110111110111010001000100010001100111011111011110110001111111010001000111010001001110100001011101001000010 e89fbee8888cefbd8fe8a091e7bea4e89fbee8888cefbd8fe88e89d0bae89fbee8888cefbd8fe8a091e7bea4e89fbee8888cefbd8fe88e89d0ba42
UHC 蟾舌o?群蟾舌o莉к蟾舌o?群蟾舌o莉кB 111000001110101011100000110111111010001111101111001111111100111111011000111000001110101011100000110111111010001111101111110101111110100110101100110111001110000011101010111000001101111110100011111011110011111111001111110110001110000011101010111000001101111110100011111011111101011111101001101011001101110001000010 e0eae0dfa3ef3fcfd8e0eae0dfa3efd7e9acdce0eae0dfa3ef3fcfd8e0eae0dfa3efd7e9acdc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)