To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????F 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f46
SJIS-WIN 鵝??肄??淫???碎λ?逆???ф?珥?F 111010100100000000111111001111111110001111100101001111110011111110001000111110100011111100111111001111111110000111101010100000111100100100111111100010110111010000111111001111110011111110000100100001100011111111100000111000000011111101000110 ea403f3fe3e53f3f88fa3f3f3fe1ea83c93f8b743f3f3f84863fe0e03f46
EUC-JP 鵝??肄??淫???碎λ?逆???ф?珥?F 111100111010000100111111001111111110011011100111001111110011111110110000111111000011111100111111001111111110001011101100101001101100101100111111101101011101010100111111001111110011111110100111111001100011111111100000111000100011111101000110 f3a13f3fe6e73f3fb0fc3f3f3fe2eca6cb3fb5d53f3f3fa7e63fe0e23f46
UTF-8 鵝숈뮆肄덃끽淫됉뉑뉩碎λ룚逆곷벚流ф룚珥콮F 1110100110110101100111011110110010001000100010001110101110101110100001101110100010000010100001001110101110001101100000111110101110000001101111011110011010110111101010111110101110010000100010011110101110001001100100011110101110001001101010011110011110100010100011101100111010111011111010111010001110011010111010011000000010000110111010101011001110110111111010111011001010011010111011111010011110001010110100011000010011101011101000111001101011100111100011111010010111101100101111011010111001000110 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb9089eb8991eb89a9e7a28ecebbeba39ae98086eab3b7ebb29aefa78ad184eba39ae78fa5ecbdae46
UHC 鵝숈뮆肄덃끽淫됉뉑뉩碎λ룚逆곷벚流ф룚珥콮F 11100100101111011001100111101100100100101001010111101100101111011000100011100110101100111010001111101011111000101000100111001011100001111110011010110100101110011110000111101111101001011110101110001111100101101110011010111101100000011110101110111010101000101110101011111100101011001110011010001111100101101110110010110100101100100100001001000110 e4bd99ec9295ecbd88e6b3a3ebe289cb87e6b4b9e1efa5eb8f96e6bd81ebbaa2eafcace68f96ecb4b24246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)