To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???港????港?[???港????港?[^ 001111110011111100111111100011010110000000111111001111110011111100111111100011010110000000111111010110110011111100111111001111111000110101100000001111110011111100111111001111111000110101100000001111110101101101011110 3f3f3f8d603f3f3f3f8d603f5b3f3f3f8d603f3f3f3f8d603f5b5e
EUC-JP ???港????港?[???港????港?[^ 001111110011111100111111101110011100000100111111001111110011111100111111101110011100000100111111010110110011111100111111001111111011100111000001001111110011111100111111001111111011100111000001001111110101101101011110 3f3f3fb9c13f3f3f3fb9c13f5b3f3f3fb9c13f3f3f3fb9c13f5b5e
UTF-8 吳됰뜂港퀊吳됰뜂港퀊[吳됰뜂港퀊吳됰뜂港퀊[^ 111001011001000010110011111010111001000010110000111010111001110010000010111001101011100010101111111011011000000010001010111001011001000010110011111010111001000010110000111010111001110010000010111001101011100010101111111011011000000010001010010110111110010110010000101100111110101110010000101100001110101110011100100000101110011010111000101011111110110110000000100010101110010110010000101100111110101110010000101100001110101110011100100000101110011010111000101011111110110110000000100010100101101101011110 e590b3eb90b0eb9c82e6b8afed808ae590b3eb90b0eb9c82e6b8afed808a5be590b3eb90b0eb9c82e6b8afed808ae590b3eb90b0eb9c82e6b8afed808a5b5e
UHC 吳됰뜂港퀊吳됰뜂港퀊[吳됰뜂港퀊吳됰뜂港퀊[^ 11100111111011111000100111101011100011011000011011111001111110111011001101111010111001111110111110001001111010111000110110000110111110011111101110110011011110100101101111100111111011111000100111101011100011011000011011111001111110111011001101111010111001111110111110001001111010111000110110000110111110011111101110110011011110100101101101011110 e7ef89eb8d86f9fbb37ae7ef89eb8d86f9fbb37a5be7ef89eb8d86f9fbb37ae7ef89eb8d86f9fbb37a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)