To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??窈??韋??節??巍ル―悠 001111110011111100111111100010011011001100111111001111111110001001110111001111110011111111101000111010000011111100111111100100001101111100111111001111111001101111011001100000111000101110000001010111001001011101001001 3f3f3f89b33f3fe2773f3fe8e83f3f90df3f3f9bd9838b815c9749
EUC-JP ???乙??窈??韋??節??巍ル―悠 001111110011111100111111101100101011010100111111001111111110001111011000001111110011111111110000111010100011111100111111110000001110000100111111001111111101011011011011101001011110101110100001101111011100110110101010 3f3f3fb2b53f3fe3d83f3ff0ea3f3fc0e13f3fd6dba5eba1bdcdaa
UTF-8 捻뀁뫑乙녹뿥窈띾맧韋썽씣節륁춾巍ル―悠 111011111010011010100100111010111000000010000001111010111010101110010001111001001011100110011001111010111000010110111001111010111011111110100101111001111010101010001000111010111001110110111110111010111010011110100111111010011001111110001011111011001000110110111101111011001001010010100011111001111010111110000000111010111010010110000001111011001011011010111110111001011011011110001101111000111000001110101011111000101000000010010101111001101000001010100000 efa6a4eb8081ebab91e4b999eb85b9ebbfa5e7aa88eb9dbeeba7a7e99f8bec8dbdec94a3e7af80eba581ecb6bee5b78de383abe28095e682a0
UHC 捻뀁뫑乙녹뿥窈띾맧韋썽씣節륁춾巍ル―悠 1110011011110111101100101110110010010001101100111110101111100000101100111110110010010111101001011110100110100001100011011110101110010000101100001110101011011111101111011110100110011101101101111110111110111101100011111110110010101101100110101110100011100100101010111110101110100001101010101110101011101101 e6f7b2ec91b3ebe0b3ec97a5e9a18deb90b0eadfbde99db7efbd8fecad9ae8e4abeba1aaeaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)