To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥違??喩??孃る?異?┐誘??五 1110010111110001001111111000000101100001100010001110000100111111001111111001101001100111001111110011111110011011011011111000001011101001001111111000100011011001001111111000010010100010100101110101010100111111001111111000110011011100 e5f13f816188e13f3f9a673f3f9b6f82e93f88d93f84a297553f3f8cdc
EUC-JP 褥?‖違??喩??孃る?異?┐誘??五 1110101011110011001111111010000111000010101100001110001100111111001111111101001111001000001111110011111111010101110100001010010011101011001111111011000011011011001111111010100010100100110011011011011000111111001111111011100011011110 eaf33fa1c2b0e33f3fd3c83f3fd5d0a4eb3fb0db3fa8a4cdb63f3fb8de
UTF-8 褥띕∥違얗윀喩쎼렃孃る쪇異뱄┐誘좊쳛五 111010001010010010100101111010111001110110010101111000101000100010100101111010011000000110010101111011001001011010010111111011001001110010000000111001011001011010101001111011001000111010111100111010111010000010000011111001011010110110000011111000111000001010001011111011001010101010000111111001111001010110110000111010111011000110000100111000101001010010010000111010001010101010011000111011001010001010001010111011001011001110011011111001001011101010010100 e8a4a5eb9d95e288a5e98195ec9697ec9c80e596a9ec8ebceba083e5ad83e3828becaa87e795b0ebb184e29490e8aa98eca28aecb39be4ba94
UHC 褥띕∥違얗윀喩쎼렃孃る쪇異뱄┐誘좊쳛五 1110100110110011101101101110101110100001101010111110101011011110101111101110100110011111100010111110101011100111100110111110001110001110100111011110010110111110101010101110101110100101100000011110110010110110101110011110111110100110101001001110101110101111101000001110101110101011100000011110011111101001 e9b3b6eba1abeadebee99f8beae79be38e9de5beaaeba581ecb6b9efa6a4ebafa0ebab81e7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)