To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?溢e?楡⑥???????矣??壤????? 1110000110011111100000111000101100111111100010001110110010000010100001010011111110011110101111101000011101000101001111110011111100111111001111110011111100111111001111111110000111100001001111110011111110011010110111110011111100111111001111110011111100111111 e19f838b3f88ec82853f9ebe87453f3f3f3f3f3f3fe1e13f3f9adf3f3f3f3f3f
EUC-JP 癲ル?溢e?楡??孼?????矣??壤??彛?? 1110001010100001101001011110101100111111101100001110111010100011111001010011111111011100110000000011111100111111100011111011101011000011001111110011111100111111001111110011111111100010111000110011111100111111110101001110000100111111001111111000111110111100111110100011111100111111 e2a1a5eb3fb0eea3e53fdcc03f3f8fbac33f3f3f3f3fe2e33f3fd4e13f3f8fbcfa3f3f
UTF-8 癲ル슪溢e봅楡⑥땡孼꾨챸留뗥꼧矣뗫쳹壤쎼굩彛쒎쳞 111001111001100110110010111000111000001110101011111011001000101010101010111001101011101010100010111011111011110110000101111010111011010010000101111001101010010110100001111000101001000110100101111010111001010110100001111001011010110110111100111010101011111010101000111011001011000110111000111011111010011110001101111010111001011110100101111010101011110010100111111001111001111110100011111010111001011110101011111011001011001110111001111001011010001110100100111011001000111010111100111010101011010110101001111001011011110110011011111011001001001010001110111011001011001110011110 e799b2e383abec8aaae6baa2efbd85ebb485e6a5a1e291a5eb95a1e5adbceabea8ecb1b8efa78deb97a5eabca7e79fa3eb97abecb3b9e5a3a4ec8ebceab5a9e5bd9bec928eecb39e
UHC 癲ル슪溢e봅楡⑥땡孼꾨챸留뗥꼧矣뗫쳹壤쎼굩彛쒎쳞 111011111010011010101011111010111001101010110011111011001110111010100011111001011011101010111110111010101111100010101000111011001011011010101111111001011110110110000100111010111010101010000101111010111010011110001011111001011000010010000100111010111111100010001011111010111010101110011100111001011011110110011011111000111000001010001111111011001010110110011100111001011010101110000100 efa6abeb9ab3eceea3e5babeeaf8a8ecb6afe5ed84ebaa85eba78be58484ebf88bebab9ce5bd9be3828fecad9ce5ab84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)