To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 酊?????蹄??魄酊?????蹄??白^ 111001111100001000111111001111110011111100111111001111111001001011111011001111110011111111101001101011101110011111000010001111110011111100111111001111110011111110010010111110110011111100111111100101001001001001011110 e7c23f3f3f3f3f92fb3f3fe9aee7c23f3f3f3f3f92fb3f3f94925e
EUC-JP 酊?????蹄??魄酊?????蹄??白^ 111011101100010000111111001111110011111100111111001111111100010011111101001111110011111111110010101100001110111011000100001111110011111100111111001111110011111111000100111111010011111100111111110001111111001001011110 eec43f3f3f3f3fc4fd3f3ff2b0eec43f3f3f3f3fc4fd3f3fc7f25e
UTF-8 酊렞渽漏렫렖蹄꿴렗魄酊렞渽漏렫렖蹄꿴렗白^ 11101001100001011000101011101011101000001001111011100110101110001011110111101111101001011000111011101011101000001010101111101011101000001001011011101000101110011000010011101010101111111011010011101011101000001001011111101001101011011000010011101001100001011000101011101011101000001001111011100110101110001011110111101111101001011000111011101011101000001010101111101011101000001001011011101000101110011000010011101010101111111011010011101011101000001001011111100111100110011011110101011110 e9858aeba09ee6b8bdefa58eeba0abeba096e8b984eabfb4eba097e9ad84e9858aeba09ee6b8bdefa58eeba0abeba096e8b984eabfb4eba097e799bd5e
UHC 酊렞渽漏렫렖蹄꿴렗魄酊렞渽漏렫렖蹄꿴렗白^ 1110111111111000100011101010111111101110101010101101001011101000100011101011100110001110101010111111000010110100101100101110100110001110101011001101101111011110111011111111100010001110101011111110111010101010110100101110100010001110101110011000111010101011111100001011010010110010111010011000111010101100110110111101110001011110 eff88eafeeaad2e88eb98eabf0b4b2e98eacdbdeeff88eafeeaad2e88eb98eabf0b4b2e98eacdbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)