To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 楮?????鬱???蓑?壹?膺?????垈^ 10011110101110000011111100111111001111110011111100111111100111110101010000111111001111110011111110010110101010100011111110011010111000110011111111100100010111100011111100111111001111110011111100111111100110101011000001011110 9eb83f3f3f3f3f9f543f3f3f96aa3f9ae33fe45e3f3f3f3f3f9ab05e
EUC-JP 楮?????鬱???蓑?壹?膺?????垈^ 11011100101110100011111100111111001111110011111100111111110111011011010100111111001111110011111111001100101011000011111111010100111001010011111111100111101111110011111100111111001111110011111100111111110101001011001001011110 dcba3f3f3f3f3fddb53f3f3fccac3fd4e53fe7bf3f3f3f3f3fd4b25e
UTF-8 楮명렪잽렯렢鬱렩뤉롭蓑옇壹맛膺펿녠쫷톽뭍垈^ 11100110101001011010111011101011101010101000010111101011101000001010101011101100100111101011110111101011101000001010111111101011101000001010001011101001101011001011000111101011101000001010100111101011101001001000100111101011101000011010110111101000100100111001000111101100100110001000011111100101101000111011100111101011101001111001101111101000100001101011101011101101100011101011111111101011100001011010000011101100101010111011011111101101100001101011110111101011101011011000110111100101100111101000100001011110 e6a5aeebaa85eba0aaec9ebdeba0afeba0a2e9acb1eba0a9eba489eba1ade89391ec9887e5a3b9eba79be886baed8ebfeb85a0ecabb7ed86bdebad8de59e885e
UHC 楮명렪잽렯렢鬱렩뤉롭蓑옇壹맛膺펿녠쫷톽뭍垈^ 11101110101111111011100011101101100011101011100011000000111011001000111010111100100011101011001111101010101001101000111010110111100011111011100110110111110100111101111011101110101111111011100011101100111011001011100011000000111010111110110010111100100011101011001111101010101001101000111010110111100011111011100110110111110100111101110001011110 eebfb8ed8eb8c0ec8ebc8eb3eaa68eb78fb9b7d3deeebfb8ececb8c0ebecbc8eb3eaa68eb78fb9b7d3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)