To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鸚??逸??企???音??永??猷??松??^ 111010100101111100111111001111111000100011101101001111110011111110001010111010010011111100111111001111111000100110111001001111110011111110001001011010010011111100111111100101110101000100111111001111111000111110111100001111110011111101011110 ea5f3f3f88ed3f3f8ae93f3f3f89b93f3f89693f3f97513f3f8fbc3f3f5e
EUC-JP 鸚??逸??企嫄??音??永??猷??松??^ 1111001111000000001111110011111110110000111011110011111100111111101101001110101110001111101110101010000100111111001111111011001010111011001111110011111110110001110010100011111100111111110011011011001000111111001111111011111010111110001111110011111101011110 f3c03f3fb0ef3f3fb4eb8fbaa13f3fb2bb3f3fb1ca3f3fcdb23f3fbebe3f3f5e
UTF-8 鸚쒖눦逸녑럳企嫄얕뼌音욍돧永띔낑猷룟♤松싳뜡^ 11101001101110001001101011101100100100101001011011101011100010001010011011101001100000001011100011101011100001011001000111101011100111111011001111100100101111001000000111100101101010111000010011101100100101101001010111101011101111001000110011101001100111111011001111101100100110101000110111101011100011111010011111100110101100001011100011101011100111011001010011101011100000101001000111100111100011001011011111101011101000111001111111100010100110011010010011100110100111011011111011101100100010111011001111101011100111001010000101011110 e9b89aec9296eb88a6e980b8eb8591eb9fb3e4bc81e5ab84ec9695ebbc8ce99fb3ec9a8deb8fa7e6b0b8eb9d94eb8291e78cb7eba39fe299a4e69dbeec8bb3eb9ca15e
UHC 鸚쒖눦逸녑럳企嫄얕뼌音욍돧永띔낑猷룟♤松싳뜡^ 111001011010010010011100111011001000011110111101111011001110111110110011111001011000111010010011110100001110101011101010101100011011111011101000100101101001010011101011111001011011111111100011100010011010101111100111101101011011011011101010101100111010100111101011101000111011011111100101101000101011101111100001111001101001101011101100100011011010010001011110 e5a49cec87bdecefb3e58e93d0eaeab1bee89694ebe5bfe389abe7b5b6eab3a9eba3b7e5a2bbe1e69aec8da45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)