To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??幼??押る?悠??飮???l? 1110101001000000001111110011111111100011111001010011111100111111100101110110001100111111001111111000100110011111100000101110100100111111100101110100100100111111001111111001111101011010001111110011111100111111100000101000110000111111 ea403f3fe3e53f3f97633f3f899f82e93f97493f3f9f5a3f3f3f828c3f
EUC-JP 鵝??肄??幼??押る?悠??飮???l? 1111001110100001001111110011111111100110111001110011111100111111110011011100010000111111001111111011001010100001101001001110101100111111110011011010101000111111001111111101110110111011001111110011111100111111101000111110110000111111 f3a13f3fe6e73f3fcdc43f3fb2a1a4eb3fcdaa3f3fddbb3f3f3fa3ec3f
UTF-8 鵝숈뮆肄덃끽幼먯춪押る굞悠밧쮦飮덇땔力l찃 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001011011100110111100111010111010100010101111111011001011011010101010111001101000101010111100111000111000001010001011111010101011010110011110111001101000001010100000111010111011000010100111111011001010111010100110111010011010001110101110111010111000110110000111111010111001010110010100111011111010011010001010111011111011110110001100111011001011000010000011 e9b59dec8888ebae86e88284eb8d83eb81bde5b9bceba8afecb6aae68abce3828beab59ee682a0ebb0a7ecaea6e9a3aeeb8d87eb9594efa68aefbd8cecb083
UHC 鵝숈뮆肄덃끽幼먯춪押る굞悠밧쮦飮덇땔力l찃 111001001011110110011001111011001001001010010101111011001011110110001000111001101011001110100011111010101110101010010000111011001010110110000111111001001110001110101010111010111000001010000110111010101110110110111001111001011010100010000011111010111110011010001000111010101011011010101010111001101011001110100011111011001010100110000111 e4bd99ec9295ecbd88e6b3a3eaea90ecad87e4e3aaeb8286eaedb9e5a883ebe688eab6aae6b3a3eca987

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)