To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???泣→?扱???蟻??繹?????誘??B 0011111100111111001111111000101110000011100000011010100000111111100010001011010100111111001111110011111110001011011000010011111100111111111000111000100000111111001111110011111100111111001111111001011101010101001111110011111101000010 3f3f3f8b8381a83f88b53f3f3f8b613f3fe3883f3f3f3f3f97553f3f42
EUC-JP ???泣→?扱???蟻??繹?????誘??B 0011111100111111001111111011010111100011101000101010101000111111101100001011011100111111001111110011111110110101110000100011111100111111111001011110100000111111001111110011111100111111001111111100110110110110001111110011111101000010 3f3f3fb5e3a2aa3fb0b73f3f3fb5c23f3fe5e83f3f3f3f3fcdb63f3f42
UTF-8 捻꿔끇泣→쨫扱琉껂빳蟻뚳펲繹먮냱履륅쫩誘⑸첊B 11101111101001101010010011101010101111111001010011101011100000011000011111100110101100111010001111100010100001101001001011101100101010001010101111100110100010011011000111101111101001111000110011101010101110111000001011101011101110011011001111101000100111111011101111101011100110101011001111101101100011101011001011100111101110011011100111101011101010001010111011101011100000111011000111101111101001111001111111101011101001011000010111101100101010111010100111101000101010101001100011100010100100011011100011101100101100101000101001000010 efa6a4eabf94eb8187e6b3a3e28692eca8abe689b1efa78ceabb82ebb9b3e89fbbeb9ab3ed8eb2e7b9b9eba8aeeb83b1efa79feba585ecaba9e8aa98e291b8ecb28a42
UHC 捻꿔끇泣→쨫扱琉껂빳蟻뚳펲繹먮냱履륅쫩誘⑸첊B 111001101111011110110010111000111000010110111011111010111110100010100001111001101010010010000101110100001110001011101011101001001000001111100100101110111010010111101011111111001000110011101111101111001000010011100110101110101001000011101011100001101000000111101100101010101000111111101111101001101000001011101011101011111010100111101011101010101001011101000010 e6f7b2e385bbebe8a1e6a485d0e2eba483e4bba5ebfc8cefbc84e6ba90eb8681ecaa8fefa682ebafa9ebaa9742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)