To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竣???原?畯經?????原?畯經冠^ 100011110111011000111111001111110011111110001100101101000011111111111011011011111110001101010011001111110011111100111111001111110011111110001100101101000011111111111011011011111110001101010011100010101010010101011110 8f763f3f3f8cb43ffb6fe3533f3f3f3f3f8cb43ffb6fe3538aa55e
EUC-JP 竣?珽?原?畯經???珽?原?畯經冠^ 101111011101011100111111100011111100101111111110001111111011100010110110001111111000111111001101101110111110010110110100001111110011111100111111100011111100101111111110001111111011100010110110001111111000111111001101101110111110010110110100101101001010011101011110 bdd73f8fcbfe3fb8b63f8fcdbbe5b43f3f3f8fcbfe3fb8b63f8fcdbbe5b4b4a75e
UTF-8 竣렞珽렰原렲畯經렜뀀ㅁ珽렰原렲畯經冠^ 11100111101010111010001111101011101000001001111011100111100011111011110111101011101000001011000011100101100011101001111111101011101000001011001011100111100101011010111111100111101101101001001111101011101000001001110011101011100000001000000011100011100001011000000111100111100011111011110111101011101000001011000011100101100011101001111111101011101000001011001011100111100101011010111111100111101101101001001111100101100001101010000001011110 e7aba3eba09ee78fbdeba0b0e58e9feba0b2e795afe7b693eba09ceb8080e38581e78fbdeba0b0e58e9feba0b2e795afe7b693e586a05e
UHC 竣렞珽렰原렲畯經렜뀀ㅁ珽렰原렲畯經冠^ 11110001111000101000111010101111111011111110101010001110101111011110101010101011100011101011111111110001111000011100110011101000100011101010111010110010111010111010010010110001111011111110101010001110101111011110101010101011100011101011111111110001111000011100110011101000110011101010111001011110 f1e28eafefea8ebdeaab8ebff1e1cce88eaeb2eba4b1efea8ebdeaab8ebff1e1cce8ceae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)