To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????kf????k^}Y????kf????k^}bE 0011111100111111001111110011111101101011011001100011111100111111001111110011111101101011010111100111110101011001001111110011111100111111001111110110101101100110001111110011111100111111001111110110101101011110011111010110001001000101 3f3f3f3f6b663f3f3f3f6b5e7d593f3f3f3f6b663f3f3f3f6b5e7d6245
SJIS-WIN 鞨丞シシkf鞨丞シシk^}Y鞨丞シシkf鞨丞シシk^}bE 11101000111000001000111111100101101111001011110001101011011001101110100011100000100011111110010110111100101111000110101101011110011111010101100111101000111000001000111111100101101111001011110001101011011001101110100011100000100011111110010110111100101111000110101101011110011111010110001001000101 e8e08fe5bcbc6b66e8e08fe5bcbc6b5e7d59e8e08fe5bcbc6b66e8e08fe5bcbc6b5e7d6245
EUC-JP 鞨丞シシkf鞨丞シシk^}Y鞨丞シシkf鞨丞シシk^}bE 111100001110001010111110111001111000111010111100100011101011110001101011011001101111000011100010101111101110011110001110101111001000111010111100011010110101111001111101010110011111000011100010101111101110011110001110101111001000111010111100011010110110011011110000111000101011111011100111100011101011110010001110101111000110101101011110011111010110001001000101 f0e2bee78ebc8ebc6b66f0e2bee78ebc8ebc6b5e7d59f0e2bee78ebc8ebc6b66f0e2bee78ebc8ebc6b5e7d6245
UTF-8 鞨丞シシkf鞨丞シシk^}Y鞨丞シシkf鞨丞シシk^}bE 11101001100111101010100011100100101110001001111011101111101111011011110011101111101111011011110001101011011001101110100110011110101010001110010010111000100111101110111110111101101111001110111110111101101111000110101101011110011111010101100111101001100111101010100011100100101110001001111011101111101111011011110011101111101111011011110001101011011001101110100110011110101010001110010010111000100111101110111110111101101111001110111110111101101111000110101101011110011111010110001001000101 e99ea8e4b89eefbdbcefbdbc6b66e99ea8e4b89eefbdbcefbdbc6b5e7d59e99ea8e4b89eefbdbcefbdbc6b66e99ea8e4b89eefbdbcefbdbc6b5e7d6245
UHC 鞨丞??kf鞨丞??k^}Y鞨丞??kf鞨丞??k^}bE 11001010111010101110001110101010001111110011111101101011011001101100101011101010111000111010101000111111001111110110101101011110011111010101100111001010111010101110001110101010001111110011111101101011011001101100101011101010111000111010101000111111001111110110101101011110011111010110001001000101 caeae3aa3f3f6b66caeae3aa3f3f6b5e7d59caeae3aa3f3f6b66caeae3aa3f3f6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)