To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???h?????????h??????^ 001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111101011110 3f3f3f683f3f3f3f3f3f3f3f3f683f3f3f3f3f3f5e
SJIS-WIN 蔭??h蔭?????蔭??h蔭?????^ 10001000111111000011111100111111011010001000100011111100001111110011111100111111001111110011111110001000111111000011111100111111011010001000100011111100001111110011111100111111001111110011111101011110 88fc3f3f6888fc3f3f3f3f3f88fc3f3f6888fc3f3f3f3f3f5e
EUC-JP 蔭??h蔭?????蔭??h蔭?????^ 10110000111111100011111100111111011010001011000011111110001111110011111100111111001111110011111110110000111111100011111100111111011010001011000011111110001111110011111100111111001111110011111101011110 b0fe3f3f68b0fe3f3f3f3f3fb0fe3f3f68b0fe3f3f3f3f3f5e
UTF-8 蔭붿꺑h蔭띾졋溜쎌꺓蔭붿꺑h蔭띾죳溜쇱꽔^ 111010001001010010101101111010111011011010111111111010101011101010010001011010001110100010010100101011011110101110011101101111101110110010100001100010111110111110100111100010111110110010001110100011001110101010111010100100111110100010010100101011011110101110110110101111111110101010111010100100010110100011101000100101001010110111101011100111011011111011101100101000111011001111101111101001111000101111101100100001111011000111101010101111011001010001011110 e894adebb6bfeaba9168e894adeb9dbeeca18befa78bec8e8ceaba93e894adebb6bfeaba9168e894adeb9dbeeca3b3efa78bec87b1eabd945e
UHC 蔭붿꺑h蔭띾졋溜쎌꺓蔭붿꺑h蔭띾죳溜쇱꽔^ 111010111110001110010100111011001000001110110111011010001110101111100011100011011110101110100000101110101110101011111110101111011110110010000011101110011110101111100011100101001110110010000011101101110110100011101011111000111000110111101011101000011000111011101010111111101011110011101100100001001010001101011110 ebe394ec83b768ebe38deba0baeafebdec83b9ebe394ec83b768ebe38deba18eeafebcec84a35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)