To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰??給幼??猷??壓?????儒?? 1110000110011111001111110011111111100000101001110011111100111111100010111000101110010111011000110011111100111111100101110101000100111111001111111001101011011000001111110011111100111111001111110011111110001110111100100011111100111111 e19f3f3fe0a73f3f8b8b97633f3f97513f3f9ad83f3f3f3f3f8ef23f3f
EUC-JP 癲??爰??給幼??猷??壓??瑗??儒?? 11100010101000010011111100111111111000001010100100111111001111111011010111101011110011011100010000111111001111111100110110110010001111110011111111010100110110100011111100111111100011111100110011000000001111110011111110111100111101000011111100111111 e2a13f3fe0a93f3fb5ebcdc43f3fcdb23f3fd4da3f3f8fccc03f3fbcf43f3f
UTF-8 癲욌맓爰덃룚給幼뽳쭔猷몄쭖壓쇰낌瑗긺빊儒섏굛 111001111001100110110010111011001001101010001100111010111010011110010011111001111000100010110000111010111000110110000011111010111010001110011010111001111011010110100110111001011011100110111100111010111011110110110011111011001010110110010100111001111000110010110111111010111010101010000100111011001010110110010110111001011010001110010011111011001000011110110000111010111000001010001100111001111001000110010111111010101011100010111010111010111011100110001010111001011000010010010010111011001000010010001111111010101011010110011011 e799b2ec9a8ceba793e788b0eb8d83eba39ae7b5a6e5b9bcebbdb3ecad94e78cb7ebaa84ecad96e5a393ec87b0eb828ce79197eab8baebb98ae58492ec848feab59b
UHC 癲욌맓爰덃룚給幼뽳쭔猷몄쭖壓쇰낌瑗긺빊儒섏굛 1110111110100110100111101110101110010000101001011110101010111010100010001110011010001111100101101101000011100101111010101110101010010110111011111010011110001100111010111010001110111000111011001010011110001110111001001110001010111100111010111011001110100110111010101011110010110001111001111001010110110000111010101110001110011000111011001000001010000011 efa69eeb90a5eaba88e68f96d0e5eaea96efa78ceba3b8eca78ee4e2bcebb3a6eabcb1e795b0eae398ec8283

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)