To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻?????淫??語??萸?┐誘??鵝 1110010011101000001111110011111100111111001111110011111110001000111110100011111100111111100011001110101000111111001111111110010011001110001111111000010010100010100101110101010100111111001111111110101001000000 e4e83f3f3f3f3f88fa3f3f8cea3f3fe4ce3f84a297553f3fea40
EUC-JP 蒻?????淫??語??萸?┐誘??鵝 1110100011101010001111110011111100111111001111110011111110110000111111000011111100111111101110001110110000111111001111111110100011010000001111111010100010100100110011011011011000111111001111111111001110100001 e8ea3f3f3f3f3fb0fc3f3fb8ec3f3fe8d03fa8a4cdb63f3ff3a1
UTF-8 蒻몃뿫留⑼쭏淫뗫뀆語ⓥ뫕萸랃┐誘↔틓鵝 111010001001001010111011111010111010101010000011111010111011111110101011111011111010011110001101111000101001000110111100111011001010110110001111111001101011011110101011111010111001011110101011111010111000000010000110111010001010101010011110111000101001001110100101111010111010101110010101111010001001000010111000111010111001111010000011111000101001010010010000111010001010101010011000111000101000011010010100111011011000101110010011111010011011010110011101 e892bbebaa83ebbfabefa78de291bcecad8fe6b7abeb97abeb8086e8aa9ee293a5ebab95e890b8eb9e83e29490e8aa98e28694ed8b93e9b59d
UHC 蒻몃뿫留⑼쭏淫뗫뀆語ⓥ뫕萸랃┐誘↔틓鵝 1110010110110110101110001110101110010111101010111110101110100111101010011110111110100111100010001110101111100010100010111110101110000101100000101110010111011110101010001110001010010001101101111110101110101101100011011110111110100110101001001110101110101111101000011110101010111010100000101110010010111101 e5b6b8eb97abeba7a9efa788ebe28beb8582e5dea8e291b7ebad8defa6a4ebafa1eaba82e4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)