To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蟾舌o荵ょキ舌o荳募キ舌o荵ょキ舌n 111001011011011110010000111000111000001010001111111001001011100110000010111001011011011110010000111000111000001010001111111001001011100010010101111001011011011110010000111000111000001010001111111001001011100110000010111001011011011110010000111000111000001010001110 e5b790e3828fe4b982e5b790e3828fe4b895e5b790e3828fe4b982e5b790e3828e
EUC-JP 蟾舌o荵ょキ舌o荳募キ舌o荵ょキ舌n 111010101011100111000000111001011010001111101111111010001011101110100100111001111000111010110111110000001110010110100011111011111110100010111010110010101110011110001110101101111100000011100101101000111110111111101000101110111010010011100111100011101011011111000000111001011010001111101110 eab9c0e5a3efe8bba4e78eb7c0e5a3efe8bacae78eb7c0e5a3efe8bba4e78eb7c0e5a3ee
UTF-8 蟾舌o荵ょキ舌o荳募キ舌o荵ょキ舌n 111010001001111110111110111010001000100010001100111011111011110110001111111010001000110110110101111000111000001010000111111011111011110110110111111010001000100010001100111011111011110110001111111010001000110110110011111001011000101110011111111011111011110110110111111010001000100010001100111011111011110110001111111010001000110110110101111000111000001010000111111011111011110110110111111010001000100010001100111011111011110110001110 e89fbee8888cefbd8fe88db5e38287efbdb7e8888cefbd8fe88db3e58b9fefbdb7e8888cefbd8fe88db5e38287efbdb7e8888cefbd8e
UHC 蟾舌o?ょ?舌o荳募?舌o?ょ?舌n 11100000111010101110000011011111101000111110111100111111101010101110011100111111111000001101111110100011111011111101010011100101110110011011010000111111111000001101111110100011111011110011111110101010111001110011111111100000110111111010001111101110 e0eae0dfa3ef3faae73fe0dfa3efd4e5d9b43fe0dfa3ef3faae73fe0dfa3ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)