To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???港??乙??^ 001111110011111100111111100011010110000000111111001111111000100110110011001111110011111101011110 3f3f3f8d603f3f89b33f3f5e
EUC-JP ???港??乙??^ 001111110011111100111111101110011100000100111111001111111011001010110101001111110011111101011110 3f3f3fb9c13f3fb2b53f3f5e
UTF-8 黎싳궡港꿩콢乙덜씞^ 11101111101001101000100111101100100010111011001111101010101101101010000111100110101110001010111111101010101111111010100111101100101111011010001011100100101110011001100111101011100011011001110011101100100101001001111001011110 efa689ec8bb3eab6a1e6b8afeabfa9ecbda2e4b999eb8d9cec949e5e
UHC 黎싳궡港꿩콢乙덜씞^ 11100110101100011001101011101100100000101011010011111001111110111011001011100110101100011001101011101011111000001011010011111010100111011011001001011110 e6b19aec82b4f9fbb2e6b19aebe0b4fa9db25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)