To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雅??????徇??怨??霓??泣hぜ義? 1000100111101011001111110011111100111111001111110011111100111111100111000110110100111111001111111000100110000101001111110011111111101000101111010011111100111111100010111000001110000010100010001000001010111010100010110110000000111111 89eb3f3f3f3f3f3f9c6d3f3f89853f3fe8bd3f3f8b83828882ba8b603f
EUC-JP 雅??????徇??怨??霓??泣hぜ義? 1011001011101101001111110011111100111111001111110011111100111111110101111100111000111111001111111011000111100101001111110011111111110000101111110011111100111111101101011110001110100011111010001010010010111100101101011100000100111111 b2ed3f3f3f3f3f3fd7ce3f3fb1e53f3ff0bf3f3fb5e3a3e8a4bcb5c13f
UTF-8 雅붞살뎾連곌퇎徇쒑퐛怨뺤삏霓띰퐢泣hぜ義괙 111010011001101110000101111010111011011010011110111011001000001010110100111010111000111010111110111011111010011010011010111010101011001110001100111011011000011110001110111001011011111010000111111011001001001010010001111011011001000010011011111001101000000010101000111010111011101010100100111011001000001010001111111010011001110010010011111010111001110110110000111011011001000010100010111001101011001110100011111011111011110110001000111000111000000110011100111001111011111010101001111010101011010010011001 e99b85ebb69eec82b4eb8ebeefa69aeab38ced878ee5be87ec9291ed909be680a8ebbaa4ec828fe99c93eb9db0ed90a2e6b3a3efbd88e3819ce7bea9eab499
UHC 雅붞살뎾連곌퇎徇쒑퐛怨뺤삏霓띰퐢泣hぜ義괙 111001001011101010010100110011101011101111101100100010011001000111100110111001101011000011101010101101111001111111100010110111111001110011101000101111011000010111101010101100111001010111101100100110001001011011100111111001111011011011101111101111011000101111101011111010001010001111101000101010101011110011101011111110011000001001000101 e4ba94cebbec8991e6e6b0eab79fe2df9ce8bd85eab395ec9896e7e7b6efbd8bebe8a3e8aabcebf98245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)