To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴉?????野?????烏l???????? 1110100111101011001111110011111100111111001111110011111110010110111011000011111100111111001111110011111100111111100010010100011110000010100011000011111100111111001111110011111100111111001111110011111100111111 e9eb3f3f3f3f3f96ec3f3f3f3f3f8947828c3f3f3f3f3f3f3f3f
EUC-JP 鴉?????野?????烏l???????? 1111001011101101001111110011111100111111001111110011111111001100111011100011111100111111001111110011111100111111101100011010100010100011111011000011111100111111001111110011111100111111001111110011111100111111 f2ed3f3f3f3f3fccee3f3f3f3f3fb1a8a3ec3f3f3f3f3f3f3f3f
UTF-8 鴉딃젺溜㏓젻野껊떯溜깅젧烏l츦隸욄틦溜곁륫溜 111010011011010010001001111010111001010010000011111011001010000010111010111011111010011110001011111000111000111110010011111011001010000010111011111010011000011110001110111010101011101110001010111010111001011010101111111011111010011110001011111010101011100110000101111011001010000010100111111001111000001110001111111011111011110110001100111011001011100010100110111011111010011010111000111011001001101010000100111011011000101110100110111011111010011110001011111010101011001110000001111010111010010110101011111011111010011110001011 e9b489eb9483eca0baefa78be38f93eca0bbe9878eeabb8aeb96afefa78beab985eca0a7e7838fefbd8cecb8a6efa6b8ec9a84ed8ba6efa78beab381eba5abefa78b
UHC 鴉딃젺溜㏓젻野껊떯溜깅젧烏l츦隸욄틦溜곁륫溜 1110010010111100100010101110100110100000101011011110101011111110101001111110101110100000101011101110010110101111100000111110101110001011101111111110101011111110101100011110101110100000100111111110100010100001101000111110110010101110100111001110011111100110100111101110011010111010100100001110101011111110101100001110011110111000101000011110101011111110 e4bc8ae9a0adeafea7eba0aee5af83eb8bbfeafeb1eba09fe8a1a3ecae9ce7e69ee6ba90eafeb0e7b8a1eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)