To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳????ぜ???烏?????淞る?? 100010100111100000111111001111110011111100111111100000101011101000111111001111110011111110001001010001110011111100111111001111110011111100111111100111111100001010000010111010010011111100111111 8a783f3f3f3f82ba3f3f3f89473f3f3f3f3f9fc282e93f3f
EUC-JP 岳??堉?ぜ洧??烏??彛??淞る?? 101100111101100100111111001111111000111110110111111111010011111110100100101111001000111111000111101101000011111100111111101100011010100000111111001111111000111110111100111110100011111100111111110111101100010010100100111010110011111100111111 b3d93f3f8fb7fd3fa4bc8fc7b43f3fb1a83f3f8fbcfa3f3fdec4a4eb3f3f
UTF-8 岳묒빘堉붻ぜ洧덉쾻烏겸뫗彛녶슖淞る졁燎 111001011011001010110011111010111010110010010010111010111011100110011000111001011010000010001001111010111011011010111011111000111000000110011100111001101011010010100111111010111000110110001001111011001011111010111011111001111000001110001111111010101011001010111000111010111010101110010111111001011011110110011011111010111000010110110110111011001000101010010110111001101011011110011110111000111000001010001011111011001010000110000001111011111010011110000000 e5b2b3ebac92ebb998e5a089ebb6bbe3819ce6b4a7eb8d89ecbebbe7838feab2b8ebab97e5bd9beb85b6ec8a96e6b79ee3828beca181efa780
UHC 岳묒빘堉붻ぜ洧덉쾻烏겸뫗彛녶슖淞る졁燎 1110010010111111100100011110110010010101101110011110101110111100100101001110100010101010101111001110101011111011100010001110110010110010100100011110100010100001101100001110001010010001101110011110110010101101100001101110010110011010101001011110000111100111101010101110101110100000101100101110100011111011 e4bf91ec95b9ebbc94e8aabceafb88ecb291e8a1b0e291b9ecad86e59aa5e1e7aaeba0b2e8fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)