To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??U??u}??U??u{^ 001111110011111101010101001111110011111101110101011111010011111100111111010101010011111100111111011101010111101101011110 3f3f553f3f757d3f3f553f3f757b5e
SJIS-WIN ??U??u}??U??u{^ 001111110011111101010101001111110011111101110101011111010011111100111111010101010011111100111111011101010111101101011110 3f3f553f3f757d3f3f553f3f757b5e
EUC-JP ??U??u}??U??u{^ 001111110011111101010101001111110011111101110101011111010011111100111111010101010011111100111111011101010111101101011110 3f3f553f3f757d3f3f553f3f757b5e
UTF-8 횂혩U횂혳u}횂혩U횂혳u{^ 11101101100110101000001011101101100110001010100101010101111011011001101010000010111011011001100010110011011101010111110111101101100110101000001011101101100110001010100101010101111011011001101010000010111011011001100010110011011101010111101101011110 ed9a82ed98a955ed9a82ed98b3757ded9a82ed98a955ed9a82ed98b3757b5e
UHC 횂혩U횂혳u}횂혩U횂혳u{^ 1100001110000010110000101001000101010101110000111000001011000010100110100111010101111101110000111000001011000010100100010101010111000011100000101100001010011010011101010111101101011110 c382c29155c382c29a757dc382c29155c382c29a757b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)