To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??音??豫?????碎??夭??認?? 111000001100111000111111001111111000100110111001001111110011111110011000101011000011111100111111001111110011111100111111111000011110101000111111001111111001101011101110001111110011111110010100010001100011111100111111 e0ce3f3f89b93f3f98ac3f3f3f3f3fe1ea3f3f9aee3f3f94463f3f
EUC-JP 猥??音??豫?????碎??夭??認?? 111000001101000000111111001111111011001010111011001111110011111111010000101011100011111100111111001111110011111100111111111000101110110000111111001111111101010011110000001111110011111111000111101001110011111100111111 e0d03f3fb2bb3f3fd0ae3f3f3f3f3fe2ec3f3fd4f03f3fc7a73f3f
UTF-8 猥롢뀧音곗몥豫곕객留뚪춳碎띔뎃夭뽮옇認욑쬂 111001111000110010100101111010111010000110100010111010111000000010100111111010011001111110110011111010101011001110010111111010111010101010100101111010001011000110101011111010101011001110010101111010101011000010011101111011111010011110001101111010111001101010101010111011001011011010110011111001111010001010001110111010111001110110010100111010111000111010000011111001011010010010101101111010111011110110101110111011001001100010000111111010001010101010001101111011001001101010010001111011001010110010000010 e78ca5eba1a2eb80a7e99fb3eab397ebaaa5e8b1abeab395eab09defa78deb9aaaecb6b3e7a28eeb9d94eb8e83e5a4adebbdaeec9887e8aa8dec9a91ecac82
UHC 猥롢뀧音곗몥豫곕객留뚪춳碎띔뎃夭뽮옇認욑쬂 111010001110010110001110111000111000010110011110111010111110010110110000111011001001000110010011111001111110001110110000111010111011000010110100111010111010011110001100111010011010110110001111111000011110111110110110111010101011010110101011111010001110110010010110111010101011111110111000111011001110001110011110111011111010011010011001 e8e58ee3859eebe5b0ec9193e7e3b0ebb0b4eba78ce9ad8fe1efb6eab5abe8ec96eabfb8ece39eefa699

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)