To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 憶?????怨?????義??怨??若??誼 10001001101011110011111100111111001111110011111100111111100010011000010100111111001111110011111100111111001111111000101101100000001111110011111110001001100001010011111100111111100011101110000100111111001111111000101101100010 89af3f3f3f3f3f89853f3f3f3f3f8b603f3f89853f3f8ee13f3f8b62
EUC-JP 憶?????怨?????義??怨??若??誼 10110010101100010011111100111111001111110011111100111111101100011110010100111111001111110011111100111111001111111011010111000001001111110011111110110001111001010011111100111111101111001110001100111111001111111011010111000011 b2b13f3f3f3f3fb1e53f3f3f3f3fb5c13f3fb1e53f3fbce33f3fb5c3
UTF-8 憶귣봺利억쭓怨뺤젍劣꾨챷義억쭓怨뺤졑若뽧꺂誼 111001101000011010110110111010101011011110100011111010111011010010111010111011111010011110011101111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010001101111011111010011010011101111010101011111010101000111011001011000110110111111001111011111010101001111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110010001111010001000101110100101111010111011110110100111111010101011101010000010111010001010101010111100 e686b6eab7a3ebb4baefa79dec96b5ecad93e680a8ebbaa4eca08defa69deabea8ecb1b7e7bea9ec96b5ecad93e680a8ebbaa4eca191e88ba5ebbda7eaba82e8aabc
UHC 憶귣봺利억쭓怨뺤젍劣꾨챷義억쭓怨뺤졑若뽧꺂誼 1110010111100011100000101110101110010100100000011110110010100110101111101110111110100111100010111110101010110011100101011110110010100000100011101110011011101011100001001110101110101010100001001110101111111001101111101110111110100111100010111110101010110011100101011110110010100000101111101110010110110100100101101110001110000011101010111110101111111110 e5e382eb9481eca6beefa78beab395eca08ee6eb84ebaa84ebf9beefa78beab395eca0bee5b496e383abebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)