To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 澳?ズ褥ワ?歪??鈺?澳?ズ褥ワ?歪??鈺?E 1110000001010011001111111000001101011001111001011111000110000011100011110011111110011000011000110011111100111111111110111100010000111111111000000101001100111111100000110101100111100101111100011000001110001111001111111001100001100011001111110011111111111011110001000011111101000101 e0533f8359e5f1838f3f98633f3ffbc43fe0533f8359e5f1838f3f98633f3ffbc43f45
EUC-JP 澳?ズ褥ワ?歪??鈺?澳?ズ褥ワ?歪??鈺?E 11011111101101000011111110100101101110101110101011110011101001011110111100111111110011111100010000111111001111111000111111100011110101010011111111011111101101000011111110100101101110101110101011110011101001011110111100111111110011111100010000111111001111111000111111100011110101010011111101000101 dfb43fa5baeaf3a5ef3fcfc43f3f8fe3d53fdfb43fa5baeaf3a5ef3fcfc43f3f8fe3d53f45
UTF-8 澳뉒ズ褥ワ푶歪뉛풜鈺쁚澳뉒ズ褥ワ푶歪뉛풜鈺쁞E 11100110101111101011001111101011100010011001001011100011100000101011101011101000101001001010010111100011100000111010111111101101100100011011011011100110101011011010101011101011100010011001101111101101100100101001110011101001100010001011101011101100100000011001101011100110101111101011001111101011100010011001001011100011100000101011101011101000101001001010010111100011100000111010111111101101100100011011011011100110101011011010101011101011100010011001101111101101100100101001110011101001100010001011101011101100100000011001111001000101 e6beb3eb8992e382bae8a4a5e383afed91b6e6adaaeb899bed929ce988baec819ae6beb3eb8992e382bae8a4a5e383afed91b6e6adaaeb899bed929ce988baec819e45
UHC 澳뉒ズ褥ワ푶歪뉛풜鈺쁚澳뉒ズ褥ワ푶歪뉛풜鈺쁞E 111001111111111010000111111001111010101110111010111010011011001110101011111011111011111010000100111010001110000010000111111011111011111010011111111010001010110110011000010110011110011111111110100001111110011110101011101110101110100110110011101010111110111110111110100001001110100011100000100001111110111110111110100111111110100010101101100110000110001001000101 e7fe87e7abbae9b3abefbe84e8e087efbe9fe8ad9859e7fe87e7abbae9b3abefbe84e8e087efbe9fe8ad986245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)