To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 怨?醍?堯?才?雕?? 10001001100001010011111110010001111001110011111111101010100111110011111110001101110010110011111111101000101110000011111100111111 89853f91e73fea9f3f8dcb3fe8b83f3f
EUC-JP 怨?醍?堯?才?雕?? 10110001111001010011111111000010111010010011111111110100101000010011111110111010110011010011111111110000101110100011111100111111 b1e53fc2e93ff4a13fbacd3ff0ba3f3f
UTF-8 怨렊醍렕堯렭才렱雕곈갬 111001101000000010101000111010111010000010001010111010011000011010001101111010111010000010010101111001011010000010101111111010111010000010101101111001101000100110001101111010111010000010110001111010011001101110010101111010101011001110001000111010101011000010101100 e680a8eba08ae9868deba095e5a0afeba0ade6898deba0b1e99b95eab388eab0ac
UHC 怨렊醍렕堯렭才렱雕곈갬 11101010101100111000111010100001111100001011010110001110101010101110100011101011100011101011101011101110101001101000111010111110111100001110011110110000111010011011000010110111 eab38ea1f0b58eaae8eb8ebaeea68ebef0e7b0e9b0b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)