To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 寃???寃?各寃???寃?各^ 100110111000001100111111001111110011111110011011100000110011111110001010011001011001101110000011001111110011111100111111100110111000001100111111100010100110010101011110 9b833f3f3f9b833f8a659b833f3f3f9b833f8a655e
EUC-JP 寃???寃?各寃???寃?各^ 110101011110001100111111001111110011111111010101111000110011111110110011110001101101010111100011001111110011111100111111110101011110001100111111101100111100011001011110 d5e33f3f3fd5e33fb3c6d5e33f3f3fd5e33fb3c65e
UTF-8 寃양렏렚寃얕各寃양렏렚寃얕各^ 11100101101011111000001111101100100101101001000111101011101000001000111111101011101000001001101011100101101011111000001111101100100101101001010111100101100100001000010011100101101011111000001111101100100101101001000111101011101000001000111111101011101000001001101011100101101011111000001111101100100101101001010111100101100100001000010001011110 e5af83ec9691eba08feba09ae5af83ec9695e59084e5af83ec9691eba08feba09ae5af83ec9695e590845e
UHC 寃양렏렚寃얕各寃양렏렚寃얕各^ 1110101010110010101111101110011110001110101001011000111010101101111010101011001010111110111010001100101011000000111010101011001010111110111001111000111010100101100011101010110111101010101100101011111011101000110010101100000001011110 eab2bee78ea58eadeab2bee8cac0eab2bee78ea58eadeab2bee8cac05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)