To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 長?材憺長?材憺^ 100100101011011100111111100011011101111010011100111010011001001010110111001111111000110111011110100111001110100101011110 92b73f8dde9ce992b73f8dde9ce95e
EUC-JP 長?材憺長?材憺^ 110001001011100100111111101110101110000011011000111010111100010010111001001111111011101011100000110110001110101101011110 c4b93fbae0d8ebc4b93fbae0d8eb5e
UTF-8 長렮材憺長렮材憺^ 11101001100101011011011111101011101000001010111011100110100111011001000011100110100001101011101011101001100101011011011111101011101000001010111011100110100111011001000011100110100001101011101001011110 e995b7eba0aee69d90e686bae995b7eba0aee69d90e686ba5e
UHC 長렮材憺長렮材憺^ 1110110111111110100011101011101111101110101001111101001110111100111011011111111010001110101110111110111010100111110100111011110001011110 edfe8ebbeea7d3bcedfe8ebbeea7d3bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)