To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 僥?ズ???裔??絶??塋?9魚??絶??^ 100110010100011000111111100000110101100100111111001111110011111111100101111000010011111100111111100100001110001000111111001111111001101011001000001111111000001001011000100010111001101100111111001111111001000011100010001111110011111101011110 99463f83593f3f3fe5e13f3f90e23f3f9ac83f82588b9b3f3f90e23f3f5e
EUC-JP 僥?ズ獒??裔??絶??塋?9魚??絶??^ 1101000110100111001111111010010110111010100011111100101110111011001111110011111111101010111000110011111100111111110000001110010000111111001111111101010011001010001111111010001110111001101101011111101100111111001111111100000011100100001111110011111101011110 d1a73fa5ba8fcbbb3f3feae33f3fc0e43f3fd4ca3fa3b9b5fb3f3fc0e43f3f5e
UTF-8 僥볡ズ獒방굢裔싷풌絶욇땽塋븃9魚됭굾絶묕풙^ 11100101100000111010010111101011101100111010000111100011100000101011101011100111100011011001001011101011101100001010100111101010101101011010001011101000101000111001010011101100100010111011011111101101100100101000110011100111101101011011011011101100100110101000011111101011100101011011110111100101101000011000101111101011101110001000001111101111101111001001100111101001101011011001101011101011100100001010110111101010101101011011111011100111101101011011011011101011101011001001010111101101100100101001100101011110 e583a5ebb3a1e382bae78d92ebb0a9eab5a2e8a394ec8bb7ed928ce7b5b6ec9a87eb95bde5a18bebb883efbc99e9ad9aeb90adeab5bee7b5b6ebac95ed92995e
UHC 僥볡ズ獒방굢裔싷풌絶욇땽塋븃9魚됭굾絶묕풙^ 11101000111010011001001111100111101010111011101011101000101000111011100111100110100000101000100111100111111000001001101011101111101111101001000111101111101111101001111011101001100010111001001111100111101010111011101011101000101000111011100111100101111000001000100111101000100000101001101011101111101111101001000111101111101111101001110001011110 e8e993e7abbae8a3b9e68289e7e09aefbe91efbe9ee98b93e7abbae8a3b9e5e089e8829aefbe91efbe9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)