To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鵝??烏l?預??抑???よ。怡??鵝??^ 11101010010000000011111100111111100010010100011110000010100011000011111110010111011000010011111100111111100101110111110100111111001111110011111110000010111001101000000101000010100111000111110100111111001111111110101001000000001111110011111101011110 ea403f3f8947828c3f97613f3f977d3f3f3f82e681429c7d3f3fea403f3f5e
EUC-JP 鵝??烏l?預??抑???よ。怡??鵝??^ 11110011101000010011111100111111101100011010100010100011111011000011111111001101110000100011111100111111110011011101111000111111001111110011111110100100111010001010000110100011110101111101111000111111001111111111001110100001001111110011111101011110 f3a13f3fb1a8a3ec3fcdc23f3fcdde3f3f3fa4e8a1a3d7de3f3ff3a13f3f5e
UTF-8 鵝얜젶烏l츦預룝퐥抑뷰슭溜よ。怡쒑듂鵝롦녂^ 11101001101101011001110111101100100101101001110011101100101000001011011011100111100000111000111111101111101111011000110011101100101110001010011011101001101000001001000011101011101000111001110111101101100100001010010111100110100010101001000111101011101101111011000011101100100010101010110111101111101001111000101111100011100000101000100011100011100000001000001011100110100000001010000111101100100100101001000111101011100100111000001011101001101101011001110111101011101000011010011011101011100001011000001001011110 e9b59dec969ceca0b6e7838fefbd8cecb8a6e9a090eba39ded90a5e68a91ebb7b0ec8aadefa78be38288e38082e680a1ec9291eb9382e9b59deba1a6eb85825e
UHC 鵝얜젶烏l츦預룝퐥抑뷰슭溜よ。怡쒑듂鵝롦녂^ 11100100101111011011111011101011101000001010101011101000101000011010001111101100101011101001110011100111111010001011011111100100101111011000111011100101111001001011101011100100101111011011111011101010111111101010101011101000101000011010001111101100101011101001110011101000100010101011011111100100101111011000111011100110100001101011101001011110 e4bdbeeba0aae8a1a3ecae9ce7e8b7e4bd8ee5e4bae4bdbeeafeaae8a1a3ecae9ce88ab7e4bd8ee686ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)