To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蜈??節?<獰↑????鸚??鶯?????^ 1110010110000101001111110011111110010000110111110011111110000001100000111110000011010110100000011010101000111111001111110011111100111111111010100101111100111111001111111110100111110010001111110011111100111111001111110011111101011110 e5853f3f90df3f8183e0d681aa3f3f3f3fea5f3f3fe9f23f3f3f3f3f5e
EUC-JP 蜈??節?<獰↑????鸚??鶯??旿??^ 11101001111001010011111100111111110000001110000100111111101000011110001111100000110110001010001010101100001111110011111100111111001111111111001111000000001111110011111111110010111101000011111100111111100011111100000111110100001111110011111101011110 e9e53f3fc0e13fa1e3e0d8a2ac3f3f3f3ff3c03f3ff2f43f3f8fc1f43f3f5e
UTF-8 蜈졾ㄷ節ㅵ<獰↑쐥嶺묋쪧鸚㏆숴鶯쇘쐴旿딁븳^ 11101000100111001000100011101100101000011011111011100011100001001011011111100111101011111000000011100011100001011011010111101111101111001001110011100111100011011011000011100010100001101001000111101100100100001010010111101111101001101010101111101011101011001000101111101100101010101010011111101001101110001001101011100011100011111000011011101100100010001011010011101001101101101010111111101100100001111001100011101100100100001011010011100110100101111011111111101011100101001000000111101011101110001011001101011110 e89c88eca1bee384b7e7af80e385b5efbc9ce78db0e28691ec90a5efa6abebac8becaaa7e9b89ae38f86ec88b4e9b6afec8798ec90b4e697bfeb9481ebb8b35e
UHC 蜈졾ㄷ節ㅵ<獰↑쐥嶺묋쪧鸚㏆숴鶯쇘쐴旿딁븳^ 11101000101001011010000011100101101001001010011111101111101111011010010011100101101000111011110011100111101111101010000111101000100111001000101011100111101011011001000111101000101001011010000011100101101001001010011111101111101111011010010011100101101000111011110011100111101111101010000111100111111110101000101011100111100101011001110001011110 e8a5a0e5a4a7efbda4e5a3bce7bea1e89c8ae7ad91e8a5a0e5a4a7efbda4e5a3bce7bea1e7fa8ae7959c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)