To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥?????藥??永??屋???????? 10011010100010110011111100111111001111110011111100111111111001010101101000111111001111111000100101101001001111110011111110001001101011100011111100111111001111110011111100111111001111110011111100111111 9a8b3f3f3f3f3fe55a3f3f89693f3f89ae3f3f3f3f3f3f3f3f
EUC-JP 嚥?????藥??永??屋???????? 11010011111010110011111100111111001111110011111100111111111010011011101100111111001111111011000111001010001111110011111110110010101100000011111100111111001111110011111100111111001111110011111100111111 d3eb3f3f3f3f3fe9bb3f3fb1ca3f3fb2b03f3f3f3f3f3f3f3f
UTF-8 嚥잒찎溜긱끇藥믥럫永뀀젚屋먮젿栒삣퇁咽롧돓 111001011001101010100101111011001001111010010010111011001011000010001110111011111010011110001011111010101011100010110001111010111000000110000111111010001001011110100101111010111010111110100101111010111001111110101011111001101011000010111000111010111000000010000000111011001010000010011010111001011011000110001011111010111010100010101110111011001010000010111111111001101010000010010010111011001000001010100011111011011000011110000001111011111010011010011110111010111010000110100111111010111000111110010011 e59aa5ec9e92ecb08eefa78beab8b1eb8187e897a5ebafa5eb9fabe6b0b8eb8080eca09ae5b18beba8aeeca0bfe6a092ec82a3ed8781efa69eeba1a7eb8f93
UHC 嚥잒찎溜긱끇藥믥럫永뀀젚屋먮젿栒삣퇁咽롧돓 111001101011111110011111111010001010100110010000111010101111111010110001111000111000010110111011111001011011011110010010111001111000111010001110111001111011010110110010111010111010000010010110111010001010100110010000111010111010000010110001111000101110001110111011111001011011011110010010111001101110110010001110111001111000100110011111 e6bf9fe8a990eafeb1e385bbe5b792e78e8ee7b5b2eba096e8a990eba0b1e2e3bbe5b792e6ec8ee7899f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)