To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????] 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5d
SJIS-WIN 鵝??肄??淫??域??悠??矣???j?] 1110101001000000001111110011111111100011111001010011111100111111100010001111101000111111001111111000100011100110001111110011111110010111010010010011111100111111111000011110000100111111001111110011111110000010100010100011111101011101 ea403f3fe3e53f3f88fa3f3f88e63f3f97493f3fe1e13f3f3f828a3f5d
EUC-JP 鵝??肄??淫??域??悠??矣???j?] 1111001110100001001111110011111111100110111001110011111100111111101100001111110000111111001111111011000011101000001111110011111111001101101010100011111100111111111000101110001100111111001111110011111110100011111010100011111101011101 f3a13f3fe6e73f3fb0fc3f3fb0e83f3fcdaa3f3fe2e33f3f3fa3ea3f5d
UTF-8 鵝숈뮆肄덃끽淫딇닞域밟뫁悠⑴솾矣곕츐力j가] 11101001101101011001110111101100100010001000100011101011101011101000011011101000100000101000010011101011100011011000001111101011100000011011110111100110101101111010101111101011100101001000011111101011100010111001111011100101100111111001111111101011101100001001111111101011101010111000000111100110100000101010000011100010100100011011010011101100100001101011111011100111100111111010001111101010101100111001010111101100101110001001000011101111101001101000101011101111101111011000101011101010101100001000000001011101 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb9487eb8b9ee59f9febb09febab81e682a0e291b4ec86bee79fa3eab395ecb890efa68aefbd8aeab0805d
UHC 鵝숈뮆肄덃끽淫딇닞域밟뫁悠⑴솾矣곕츐力j가] 11100100101111011001100111101100100100101001010111101100101111011000100011100110101100111010001111101011111000101000101011101101100010001001111011100110101101001011100111100010100100011010010111101010111011011010100111100111100110011011001011101011111110001011000011101011101011101000101111100110101100111010001111101010101100001010000101011101 e4bd99ec9295ecbd88e6b3a3ebe28aed889ee6b4b9e291a5eaeda9e799b2ebf8b0ebae8be6b3a3eab0a15d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)