To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 曜?????節ら?誤??円??燿⑨?娃??B 100101110110101000111111001111110011111100111111001111111001000011011111100000101110011100111111100011001110101100111111001111111000100101111110001111110011111111100000101000001000011101001000001111111000100010100001001111110011111101000010 976a3f3f3f3f3f90df82e73f8ceb3f3f897e3f3fe0a087483f88a13f3f42
EUC-JP 曜?????節ら?誤??円??燿??娃??B 1100110111001011001111110011111100111111001111110011111111000000111000011010010011101001001111111011100011101101001111110011111110110001110111110011111100111111111000001010001000111111001111111011000010100011001111110011111101000010 cdcb3f3f3f3f3fc0e1a4e93fb8ed3f3fb1df3f3fe0a23f3fb0a33f3f42
UTF-8 曜경솄勵뺡옩節ら걦誤곲슦円됪쐲燿⑨쉠娃뤺쫨B 11100110100110111001110011101010101100101011110111101100100001101000010011101111101001011011111111101011101110101010000111101100100110001010100111100111101011111000000011100011100000101000100111101010101100011010011011101000101010101010010011101010101100111011001011101100100010101010011011100101100001101000011011101011100100001010101011101100100100001011001011100111100001111011111111100010100100011010100011101100100010011010000011100101101010001000001111101011101001001011101011101100101010111010100001000010 e69b9ceab2bdec8684efa5bfebbaa1ec98a9e7af80e38289eab1a6e8aaa4eab3b2ec8aa6e58686eb90aaec90b2e787bfe291a8ec89a0e5a883eba4baecaba842
UHC 曜경솄勵뺡옩節ら걦誤곲슦円됪쐲燿⑨쉠娃뤺쫨B 11101000111110001011000011100110100110011000100111100101111110101001010111101001100111101010100011101111101111011010101011101001100000011000111111101000101001101000000111101001100110101011000011100101111101111000100111100110100111001001010111101000111111001010100011101111101111011010101011101000110111111000111111101000101001101000000101000010 e8f8b0e69989e5fa95e99ea8efbdaae9818fe8a681e99ab0e5f789e69c95e8fca8efbdaae8df8fe8a68142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)