To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????O??U 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f3f3f55
SJIS-WIN ??????????????????O??U 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f3f3f55
EUC-JP ??????????????????O??U 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f3f3f55
UTF-8 횂혩횂쨘횂혩횄쩐횂혩횂쨘횂혩횂쨍횂혪O횂혪U 1110110110011010100000101110110110011000101010011110110110011010100000101110110010101000100110001110110110011010100000101110110110011000101010011110110110011010100001001110110010101001100100001110110110011010100000101110110110011000101010011110110110011010100000101110110010101000100110001110110110011010100000101110110110011000101010011110110110011010100000101110110010101000100011011110110110011010100000101110110110011000101010100100111111101101100110101000001011101101100110001010101001010101 ed9a82ed98a9ed9a82eca898ed9a82ed98a9ed9a84eca990ed9a82ed98a9ed9a82eca898ed9a82ed98a9ed9a82eca88ded9a82ed98aa4fed9a82ed98aa55
UHC 횂혩횂쨘횂혩횄쩐횂혩횂쨘횂혩횂쨍횂혪O횂혪U 110000111000001011000010100100011100001110000010110000101011101011000011100000101100001010010001110000111000001111000010101111101100001110000010110000101001000111000011100000101100001010111010110000111000001011000010100100011100001110000010110000101011100011000011100000101100001010010010010011111100001110000010110000101001001001010101 c382c291c382c2bac382c291c383c2bec382c291c382c2bac382c291c382c2b8c382c2924fc382c29255

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)