To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 渦????オ???D渦????オ???D^ 10001001010100010011111100111111001111110011111110000011010010010011111100111111001111110100010010001001010100010011111100111111001111110011111110000011010010010011111100111111001111110100010001011110 89513f3f3f3f83493f3f3f4489513f3f3f3f83493f3f3f445e
EUC-JP 渦????オ???D渦????オ???D^ 10110001101100100011111100111111001111110011111110100101101010100011111100111111001111110100010010110001101100100011111100111111001111110011111110100101101010100011111100111111001111110100010001011110 b1b23f3f3f3fa5aa3f3f3f44b1b23f3f3f3fa5aa3f3f3f445e
UTF-8 渦욕콪嶪뤹オ呂묈춼D渦욕콪嶪뤹オ呂묈춼D^ 111001101011100010100110111011001001101010010101111011001011110110101010111001011011011010101010111010111010010010111001111000111000001010101010111011111010011010000000111010111010110010001000111011001011011010111100010001001110011010111000101001101110110010011010100101011110110010111101101010101110010110110110101010101110101110100100101110011110001110000010101010101110111110100110100000001110101110101100100010001110110010110110101111000100010001011110 e6b8a6ec9a95ecbdaae5b6aaeba4b9e382aaefa680ebac88ecb6bc44e6b8a6ec9a95ecbdaae5b6aaeba4b9e382aaefa680ebac88ecb6bc445e
UHC 渦욕콪嶪뤹オ呂묈춼D渦욕콪嶪뤹オ呂묈춼D^ 111010001011111010111111111001011011000110011110111001011111010110001111111001111010101110101010111001011111101110010001111001011010110110011000010001001110100010111110101111111110010110110001100111101110010111110101100011111110011110101011101010101110010111111011100100011110010110101101100110000100010001011110 e8bebfe5b19ee5f58fe7abaae5fb91e5ad9844e8bebfe5b19ee5f58fe7abaae5fb91e5ad98445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)