To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???魏?????維??蟻??語⑤?議?┸ 00111111001111110011111111101001101100000011111100111111001111110011111100111111100010001101101100111111001111111000101101100001001111110011111110001100111010101000011101000100001111111000101101100011001111111000010010111101 3f3f3fe9b03f3f3f3f3f88db3f3f8b613f3f8cea87443f8b633f84bd
EUC-JP ???魏?????維??蟻??語??議?┸ 001111110011111100111111111100101011001000111111001111110011111100111111001111111011000011011101001111110011111110110101110000100011111100111111101110001110110000111111001111111011010111000100001111111010100010111111 3f3f3ff2b23f3f3f3f3fb0dd3f3fb5c23f3fb8ec3f3fb5c43fa8bf
UTF-8 捻꿔겚魏뉕데嶺뚮뿢維뽪꼷蟻욎춷語⑤벡議롳┸ 111011111010011010100100111010101011111110010100111010101011001010011010111010011010110110001111111010111000100110010101111010111000110110110000111011111010011010101011111010111001101010101110111010111011111110100010111001111011011010101101111010111011110110101010111010101011110010110111111010001001111110111011111011001001101010001110111011001011011010110111111010001010101010011110111000101001000110100100111010111011001010100001111010001010110110110000111010111010000110110011111000101001010010111000 efa6a4eabf94eab29ae9ad8feb8995eb8db0efa6abeb9aaeebbfa2e7b6adebbdaaeabcb7e89fbbec9a8eecb6b7e8aa9ee291a4ebb2a1e8adb0eba1b3e294b8
UHC 捻꿔겚魏뉕데嶺뚮뿢維뽪꼷蟻욎춷語⑤벡議롳┸ 111001101111011110110010111000111000000110110001111010101110000010000111111010101011010110100101111001111010110110001100111010111001011110100010111010111010101110010110111001101000010010001111111010111111110010011110111011001010110110010011111001011101111010101000111010111011101010100100111011001010000110001110111011111010011010111111 e6f7b2e381b1eae087eab5a5e7ad8ceb97a2ebab96e6848febfc9eecad93e5dea8ebbaa4eca18eefa6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)