To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 艶?????獄??}艶?????獄??{^ 10001001100100000011111100111111001111110011111100111111100011011001011000111111001111110111110110001001100100000011111100111111001111110011111100111111100011011001011000111111001111110111101101011110 89903f3f3f3f3f8d963f3f7d89903f3f3f3f3f8d963f3f7b5e
EUC-JP 艶?????獄??}艶?????獄??{^ 10110001111100000011111100111111001111110011111100111111101110011111011000111111001111110111110110110001111100000011111100111111001111110011111100111111101110011111011000111111001111110111101101011110 b1f03f3f3f3f3fb9f63f3f7db1f03f3f3f3f3fb9f63f3f7b5e
UTF-8 艶녕윹力꾣누獄멱윭}艶녕윹力꾣누獄멱윭{^ 111010001000100110110110111010111000010110010101111011001001110010111001111011111010011010001010111010101011111010100011111010111000100010000100111001111000110110000100111010111010100110110001111011001001110010101101011111011110100010001001101101101110101110000101100101011110110010011100101110011110111110100110100010101110101010111110101000111110101110001000100001001110011110001101100001001110101110101001101100011110110010011100101011010111101101011110 e889b6eb8595ec9cb9efa68aeabea3eb8884e78d84eba9b1ec9cad7de889b6eb8595ec9cb9efa68aeabea3eb8884e78d84eba9b1ec9cad7b5e
UHC 艶녕윹力꾣누獄멱윭}艶녕윹力꾣누獄멱윭{^ 111001101111110110110011111001111001111110110011111001101011001110000100111001101011010010101001111010001010101110111000111010001001111110101100011111011110011011111101101100111110011110011111101100111110011010110011100001001110011010110100101010011110100010101011101110001110100010011111101011000111101101011110 e6fdb3e79fb3e6b384e6b4a9e8abb8e89fac7de6fdb3e79fb3e6b384e6b4a9e8abb8e89fac7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)