To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 潁??徇????????幽?????午??^ 1001111111110001001111110011111110011100011011010011111100111111001111110011111100111111001111110011111100111111100101110100100000111111001111110011111100111111001111111000110011011111001111110011111101011110 9ff13f3f9c6d3f3f3f3f3f3f3f3f97483f3f3f3f3f8cdf3f3f5e
EUC-JP 潁??徇????????幽?????午??^ 1101111011110011001111110011111111010111110011100011111100111111001111110011111100111111001111110011111100111111110011011010100100111111001111110011111100111111001111111011100011100001001111110011111101011110 def33f3fd7ce3f3f3f3f3f3f3f3fcda93f3f3f3f3fb8e13f3f5e
UTF-8 潁욌뜙徇볥젎琉덅룆若뤹툧幽쒌걪溜뤿쨮午끾쭚^ 11100110101111011000000111101100100110101000110011101011100111001001100111100101101111101000011111101011101100111010010111101100101000001000111011101111101001111000110011101011100011011000010111101011101000111000011011101111101001011011010011101011101001001011100111101101100010001010011111100101101110011011110111101100100100101000110011101010101100011010101011101111101001111000101111101011101001001011111111101100101010001010111011100101100011011000100011101011100000011011111011101100101011011001101001011110 e6bd81ec9a8ceb9c99e5be87ebb3a5eca08eefa78ceb8d85eba386efa5b4eba4b9ed88a7e5b9bdec928ceab1aaefa78beba4bfeca8aee58d88eb81beecad9a5e
UHC 潁욌뜙徇볥젎琉덅룆若뤹툧幽쒌걪溜뤿쨮午끾쭚^ 11100111101110001001111011101011100011011001110011100010110111111001001111101011101000001000111111101011101001001000100011101000100011111000010111100101101011101000111111100111101110001001111011101010111010111001110011100011100000011001001111101010111111101000111111101011101001001000100011100111111011011000010111100110101001111001000001011110 e7b89eeb8d9ce2df93eba08feba488e88f85e5ae8fe7b89eeaeb9ce38193eafe8feba488e7ed85e6a7905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)