To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??淫???♂?????毅??喩 1110101001000000001111110011111111100011111001010011111100111111100010001111101000111111001111110011111110000001100010010011111100111111001111110011111100111111100010110100001000111111001111111001101001100111 ea403f3fe3e53f3f88fa3f3f3f81893f3f3f3f3f8b423f3f9a67
EUC-JP 鵝??肄??淫??璵♂?????毅??喩 11110011101000010011111100111111111001101110011100111111001111111011000011111100001111110011111110001111110011001110011010100001111010010011111100111111001111110011111100111111101101011010001100111111001111111101001111001000 f3a13f3fe6e73f3fb0fc3f3f8fcce6a1e93f3f3f3f3fb5a33f3fd3c8
UTF-8 鵝숈뮆肄덃끽淫뚮뙑璵♂쎈땴麗몃씈毅싨끽喩 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001101011011110101011111010111001101010101110111010111001100110010001111001111001001010110101111000101001100110000010111011001000111010001000111010111001010110110100111011111010011010001000111010111010101010000011111011001001010010001000111001101010111110000101111011001000101110101000111010111000000110111101111001011001011010101001 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb9aaeeb9991e792b5e29982ec8e88eb95b4efa688ebaa83ec9488e6af85ec8ba8eb81bde596a9
UHC 鵝숈뮆肄덃끽淫뚮뙑璵♂쎈땴麗몃씈毅싨끽喩 11100100101111011001100111101100100100101001010111101100101111011000100011100110101100111010001111101011111000101000110011101011100011001001011011100110101001011010000111001110101111011110101110001011100010101110011010110000101110001110101110011101101000001110101111110110100110101110011010110011101000111110101011100111 e4bd99ec9295ecbd88e6b3a3ebe28ceb8c96e6a5a1cebdeb8b8ae6b0b8eb9da0ebf69ae6b3a3eae7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)