To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 艶?艶?[艶?艶?[^ 100010011001000000111111100010011001000000111111010110111000100110010000001111111000100110010000001111110101101101011110 89903f89903f5b89903f89903f5b5e
EUC-JP 艶?艶?[艶?艶?[^ 101100011111000000111111101100011111000000111111010110111011000111110000001111111011000111110000001111110101101101011110 b1f03fb1f03f5bb1f03fb1f03f5b5e
UTF-8 艶쵰艶쵰[艶쵰艶쵰[^ 111010001000100110110110111011001011010110110000111010001000100110110110111011001011010110110000010110111110100010001001101101101110110010110101101100001110100010001001101101101110110010110101101100000101101101011110 e889b6ecb5b0e889b6ecb5b05be889b6ecb5b0e889b6ecb5b05b5e
UHC 艶쵰艶쵰[艶쵰艶쵰[^ 11100110111111011010110101001100111001101111110110101101010011000101101111100110111111011010110101001100111001101111110110101101010011000101101101011110 e6fdad4ce6fdad4c5be6fdad4ce6fdad4c5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)