To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄μ????怨?????義??怨??癲??誼 1001011011101111100000111100101000111111001111110011111100111111100010011000010100111111001111110011111100111111001111111000101101100000001111110011111110001001100001010011111100111111111000011001111100111111001111111000101101100010 96ef83ca3f3f3f3f89853f3f3f3f3f8b603f3f89853f3fe19f3f3f8b62
EUC-JP 厄μ????怨?????義??怨??癲??誼 1100110011110001101001101100110000111111001111110011111100111111101100011110010100111111001111110011111100111111001111111011010111000001001111110011111110110001111001010011111100111111111000101010000100111111001111111011010111000011 ccf1a6cc3f3f3f3fb1e53f3f3f3f3fb5c13f3fb1e53f3fe2a13f3fb5c3
UTF-8 厄μ옊璘뚳쭓怨뺤졄劣꾨챷義억쭓怨뺤졑癲됱쥓誼 1110010110001110100001001100111010111100111011001001100010001010111011111010011110101111111010111001101010110011111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110000100111011111010011010011101111010101011111010101000111011001011000110110111111001111011111010101001111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110010001111001111001100110110010111010111001000010110001111011001010010110010011111010001010101010111100 e58e84cebcec988aefa7afeb9ab3ecad93e680a8ebbaa4eca184efa69deabea8ecb1b7e7bea9ec96b5ecad93e680a8ebbaa4eca191e799b2eb90b1eca593e8aabc
UHC 厄μ옊璘뚳쭓怨뺤졄劣꾨챷義억쭓怨뺤졑癲됱쥓誼 1110010011111000101001011110110010011110100100101110110011011110100011001110111110100111100010111110101010110011100101011110110010100000101101011110011011101011100001001110101110101010100001001110101111111001101111101110111110100111100010111110101010110011100101011110110010100000101111101110111110100110100010011110110010100010100010101110101111111110 e4f8a5ec9e92ecde8cefa78beab395eca0b5e6eb84ebaa84ebf9beefa78beab395eca0beefa689eca28aebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)