To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 彫?宰?珥?????彫?宰?珥?????^ 100100101010010000111111100011011100100100111111111000001110000000111111001111110011111100111111001111111001001010100100001111111000110111001001001111111110000011100000001111110011111100111111001111110011111101011110 92a43f8dc93fe0e03f3f3f3f3f92a43f8dc93fe0e03f3f3f3f3f5e
EUC-JP 彫?宰?珥?焌???彫?宰?珥?焌???^ 11000100101001100011111110111010110010110011111111100000111000100011111110001111110010011110100000111111001111110011111111000100101001100011111110111010110010110011111111100000111000100011111110001111110010011110100000111111001111110011111101011110 c4a63fbacb3fe0e23f8fc9e83f3f3fc4a63fbacb3fe0e23f8fc9e83f3f3f5e
UTF-8 彫렣宰렞珥렜焌렦롊렯彫렣宰렞珥렜焌렦롊렭^ 11100101101111011010101111101011101000001010001111100101101011101011000011101011101000001001111011100111100011111010010111101011101000001001110011100111100001001000110011101011101000001010011011101011101000011000101011101011101000001010111111100101101111011010101111101011101000001010001111100101101011101011000011101011101000001001111011100111100011111010010111101011101000001001110011100111100001001000110011101011101000001010011011101011101000011000101011101011101000001010110101011110 e5bdabeba0a3e5aeb0eba09ee78fa5eba09ce7848ceba0a6eba18aeba0afe5bdabeba0a3e5aeb0eba09ee78fa5eba09ce7848ceba0a6eba18aeba0ad5e
UHC 彫렣宰렞珥렜焌렦롊렯彫렣宰렞珥렜焌렦롊렭^ 1111000011000001100011101011010011101110101001011000111010101111111011001011010010001110101011101111000111100000100011101011010110001110110100001000111010111100111100001100000110001110101101001110111010100101100011101010111111101100101101001000111010101110111100011110000010001110101101011000111011010000100011101011101001011110 f0c18eb4eea58eafecb48eaef1e08eb58ed08ebcf0c18eb4eea58eafecb48eaef1e08eb58ed08eba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)