To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 若??踰?????[若??踰?????[^ 10001110111000010011111100111111111001101111101000111111001111110011111100111111001111110101101110001110111000010011111100111111111001101111101000111111001111110011111100111111001111110101101101011110 8ee13f3fe6fa3f3f3f3f3f5b8ee13f3fe6fa3f3f3f3f3f5b5e
EUC-JP 若??踰?????[若??踰?????[^ 10111100111000110011111100111111111011001111110000111111001111110011111100111111001111110101101110111100111000110011111100111111111011001111110000111111001111110011111100111111001111110101101101011110 bce33f3fecfc3f3f3f3f3f5bbce33f3fecfc3f3f3f3f3f5b5e
UTF-8 若뽧뀿踰잌킊栒듬쿊[若뽧뀿踰잌킊栒듬쿊[^ 111010001000101110100101111010111011110110100111111010111000000010111111111010001011100010110000111011001001111010001100111011011000001010001010111001101010000010010010111010111001001110101100111011001011111110001010010110111110100010001011101001011110101110111101101001111110101110000000101111111110100010111000101100001110110010011110100011001110110110000010100010101110011010100000100100101110101110010011101011001110110010111111100010100101101101011110 e88ba5ebbda7eb80bfe8b8b0ec9e8ced828ae6a092eb93acecbf8a5be88ba5ebbda7eb80bfe8b8b0ec9e8ced828ae6a092eb93acecbf8a5b5e
UHC 若뽧뀿踰잌킊栒듬쿊[若뽧뀿踰잌킊栒듬쿊[^ 111001011011010010010110111000111000010110110101111010111011001010011111111001011011010010010110111000101110001110110101111010111011001010011111010110111110010110110100100101101110001110000101101101011110101110110010100111111110010110110100100101101110001011100011101101011110101110110010100111110101101101011110 e5b496e385b5ebb29fe5b496e2e3b5ebb29f5be5b496e385b5ebb29fe5b496e2e3b5ebb29f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)