To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦????????瘟??耶ョ?言??塋や? 10001001010100010011111100111111001111110011111100111111001111110011111100111111111000011000100100111111001111111001011011101011100000111000011100111111100011001011111000111111001111111001101011001000100000101110001000111111 89513f3f3f3f3f3f3f3fe1893f3f96eb83873f8cbe3f3f9ac882e23f
EUC-JP 渦????????瘟??耶ョ?言??塋や? 10110001101100100011111100111111001111110011111100111111001111110011111100111111111000011110100100111111001111111100110011101101101001011110011100111111101110001100000000111111001111111101010011001010101001001110010000111111 b1b23f3f3f3f3f3f3f3fe1e93f3fcceda5e73fb8c03f3fd4caa4e43f
UTF-8 渦겻퐜呂묋갬嶪뤷쩀瘟룩큹耶ョ떥言됭큹塋や퍟 111001101011100010100110111010101011001010111011111011011001000010011100111011111010011010000000111010111010110010001011111010101011000010101100111001011011011010101010111010111010010010110111111011001010100110000000111001111001100010011111111010111010001110101001111011011000000110111001111010001000000010110110111000111000001110100111111010111001011010100101111010001010100010000000111010111001000010101101111011011000000110111001111001011010000110001011111000111000001010000100111011011000110110011111 e6b8a6eab2bbed909cefa680ebac8beab0ace5b6aaeba4b7eca980e7989feba3a9ed81b9e880b6e383a7eb96a5e8a880eb90aded81b9e5a18be38284ed8d9f
UHC 渦겻퐜呂묋갬嶪뤷쩀瘟룩큹耶ョ떥言됭큹塋や퍟 111010001011111010110000111001001011110110000110111001011111101110010001111010001011000010110111111001011111010110001111111001011010010010011010111010001011000010110111111010001011010010001000111001011010110110101011111001111000101110111000111001011110101110001001111010001011010010001000111001111010101110101010111001001011101110010110 e8beb0e4bd86e5fb91e8b0b7e5f58fe5a49ae8b0b7e8b488e5adabe78bb8e5eb89e8b488e7abaae4bb96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)