To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 塢??節??娃?n}塢??節??娃?n{^ 100110101100011100111111001111111001000011011111001111110011111110001000101000010011111101101110011111011001101011000111001111110011111110010000110111110011111100111111100010001010000100111111011011100111101101011110 9ac73f3f90df3f3f88a13f6e7d9ac73f3f90df3f3f88a13f6e7b5e
EUC-JP 塢??節?Ŋ娃?n}塢??節?Ŋ娃?n{^ 11010100110010010011111100111111110000001110000100111111100011111010100110101011101100001010001100111111011011100111110111010100110010010011111100111111110000001110000100111111100011111010100110101011101100001010001100111111011011100111101101011110 d4c93f3fc0e13f8fa9abb0a33f6e7dd4c93f3fc0e13f8fa9abb0a33f6e7b5e
UTF-8 塢곩뻔節겼Ŋ娃쮗n}塢곩뻔節겼Ŋ娃쮗n{^ 111001011010000110100010111010101011001110101001111010111011101110010100111001111010111110000000111010101011001010111100110001011000101011100101101010001000001111101100101011101001011101101110011111011110010110100001101000101110101010110011101010011110101110111011100101001110011110101111100000001110101010110010101111001100010110001010111001011010100010000011111011001010111010010111011011100111101101011110 e5a1a2eab3a9ebbb94e7af80eab2bcc58ae5a883ecae976e7de5a1a2eab3a9ebbb94e7af80eab2bcc58ae5a883ecae976e7b5e
UHC 塢곩뻔節겼Ŋ娃쮗n}塢곩뻔節겼Ŋ娃쮗n{^ 11100111111100011000000111100101101110111011011111101111101111011011000011100101101010001010111111101000110111111010100001101111011011100111110111100111111100011000000111100101101110111011011111101111101111011011000011100101101010001010111111101000110111111010100001101111011011100111101101011110 e7f181e5bbb7efbdb0e5a8afe8dfa86f6e7de7f181e5bbb7efbdb0e5a8afe8dfa86f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)