To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 馭???∽?矜爰?馭???∽?矜爰?B 111010010110011000111111001111110011111110000001111001000011111111100001111000001110000010100111001111111110100101100110001111110011111100111111100000011110010000111111111000011110000011100000101001110011111101000010 e9663f3f3f81e43fe1e0e0a73fe9663f3f3f81e43fe1e0e0a73f42
EUC-JP 馭??彛∽?矜爰?馭??彛∽?矜爰?B 11110001110001110011111100111111100011111011110011111010101000101110011000111111111000101110001011100000101010010011111111110001110001110011111100111111100011111011110011111010101000101110011000111111111000101110001011100000101010010011111101000010 f1c73f3f8fbcfaa2e63fe2e2e0a93ff1c73f3f8fbcfaa2e63fe2e2e0a93f42
UTF-8 馭곥룂彛∽㎗矜爰펦馭곥룂彛∽㎗矜爰펦B 11101001101001101010110111101010101100111010010111101011101000111000001011100101101111011001101111100010100010001011110111100011100011101001011111100111100111111001110011100111100010001011000011101101100011101010011011101001101001101010110111101010101100111010010111101011101000111000001011100101101111011001101111100010100010001011110111100011100011101001011111100111100111111001110011100111100010001011000011101101100011101010011001000010 e9a6adeab3a5eba382e5bd9be288bde38e97e79f9ce788b0ed8ea6e9a6adeab3a5eba382e5bd9be288bde38e97e79f9ce788b0ed8ea642
UHC 馭곥룂彛∽㎗矜爰펦馭곥룂彛∽㎗矜爰펦B 11100101110111111000000111100011100011111000001111101100101011011010000111101111101001111010001111010000111010001110101010111010101111000111011011100101110111111000000111100011100011111000001111101100101011011010000111101111101001111010001111010000111010001110101010111010101111000111011001000010 e5df81e38f83ecada1efa7a3d0e8eababc76e5df81e38f83ecada1efa7a3d0e8eababc7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)