To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??音????????應??藥〓?淫? 1110000011001110001111110011111110001001101110010011111100111111001111110011111100111111001111110011111100111111100111001110010000111111001111111110010101011010100000011010110000111111100010001111101000111111 e0ce3f3f89b93f3f3f3f3f3f3f3f9ce43f3fe55a81ac3f88fa3f
EUC-JP 猥??音?????沅??應??藥〓?淫? 11100000110100000011111100111111101100101011101100111111001111110011111100111111001111111000111111000110111010010011111100111111110110001110011000111111001111111110100110111011101000101010111000111111101100001111110000111111 e0d03f3fb2bb3f3f3f3f3f8fc6e93f3fd8e63f3fe9bba2ae3fb0fc3f
UTF-8 猥롢뀧音곗몷嶺뚮뜆沅싨꼮應쇳돦藥〓낄淫츭 111001111000110010100101111010111010000110100010111010111000000010100111111010011001111110110011111010101011001110010111111010111010101010110111111011111010011010101011111010111001101010101110111010111001110010000110111001101011001010000101111011001000101110101000111010101011110010101110111001101000011110001001111011001000011110110011111010111000111110100110111010001001011110100101111000111000000010010011111010111000001010000100111001101011011110101011111011001011100010101101 e78ca5eba1a2eb80a7e99fb3eab397ebaab7efa6abeb9aaeeb9c86e6b285ec8ba8eabcaee68789ec87b3eb8fa6e897a5e38093eb8284e6b7abecb8ad
UHC 猥롢뀧音곗몷嶺뚮뜆沅싨꼮應쇳돦藥〓낄淫츭 11101000111001011000111011100011100001011001111011101011111001011011000011101100100100011001111111100111101011011000110011101011100011011000100111101010101101101001101011100110100001001000100111101011111010111011110011101101100010011010101011100101101101111010000111101011101100111010010111101011111000101010111101000010 e8e58ee3859eebe5b0ec919fe7ad8ceb8d89eab69ae68489ebebbced89aae5b7a1ebb3a5ebe2af42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)