To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥??爰??醫?┛歪??循??飮??孃??異 10011010100010110011111100111111111000001010011100111111001111111110011111001110001111111000010010101110100110000110001100111111001111111000111101111010001111110011111110011111010110100011111100111111100110110110111100111111001111111000100011011001 9a8b3f3fe0a73f3fe7ce3f84ae98633f3f8f7a3f3f9f5a3f3f9b6f3f3f88d9
EUC-JP 嚥??爰??醫?┛歪??循??飮??孃??異 11010011111010110011111100111111111000001010100100111111001111111110111011010000001111111010100010110000110011111100010000111111001111111011110111011011001111110011111111011101101110110011111100111111110101011101000000111111001111111011000011011011 d3eb3f3fe0a93f3feed03fa8b0cfc43f3fbddb3f3fddbb3f3fd5d03f3fb0db
UTF-8 嚥싲갇爰귝룚醫꾨┛歪묆룂循⑶샒飮뗮맋孃뉕퇍異 111001011001101010100101111011001000101110110010111010101011000010000111111001111000100010110000111010101011011110011101111010111010001110011010111010011000011010101011111010101011111010101000111000101001010010011011111001101010110110101010111010111010110010000110111010111010001110000010111001011011111010101010111000101001000110110110111011001000001110010010111010011010001110101110111010111001011110101110111010111010011110001011111001011010110110000011111010111000100110010101111011011000011110001101111001111001010110110000 e59aa5ec8bb2eab087e788b0eab79deba39ae986abeabea8e2949be6adaaebac86eba382e5beaae291b6ec8392e9a3aeeb97aeeba78be5ad83eb8995ed878de795b0
UHC 嚥싲갇爰귝룚醫꾨┛歪묆룂循⑶샒飮뗮맋孃뉕퇍異 1110011010111111100110101110101110110000101001001110101010111010100000101110011010001111100101101110110010100010100001001110101110100110101100001110100011100000100100011110001110001111100000111110001011100000101010011110100110011000101111111110101111100110100010111110110110010000101000111110010110111110100001111110101010110111100111101110110010110110 e6bf9aebb0a4eaba82e68f96eca284eba6b0e8e091e38f83e2e0a9e998bfebe68bed90a3e5be87eab79eecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)