To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 혧짬짧hE혧짬짧hEB 1110110110011000101001111110110010100111101011001110110010100111101001110110100001000101111011011001100010100111111011001010011110101100111011001010011110100111011010000100010101000010 ed98a7eca7aceca7a76845ed98a7eca7aceca7a7684542
SJIS-WIN ??§?§¬?§§hE??§?§¬?§§hEB 001111110011111110000001100110000011111110000001100110001000000111001010001111111000000110011000100000011001100001101000010001010011111100111111100000011001100000111111100000011001100010000001110010100011111110000001100110001000000110011000011010000100010101000010 3f3f81983f819881ca3f8198819868453f3f81983f819881ca3f81988198684542
EUC-JP �짬짧hE�짬짧hEB 100011111010101110111111001111111010000111111000100011111010101111000000101000011111100010100010110011001000111110101011110000001010000111111000101000011111100001101000010001011000111110101011101111110011111110100001111110001000111110101011110000001010000111111000101000101100110010001111101010111100000010100001111110001010000111111000011010000100010101000010 8fabbf3fa1f88fabc0a1f8a2cc8fabc0a1f8a1f868458fabbf3fa1f88fabc0a1f8a2cc8fabc0a1f8a1f8684542
UTF-8 혧짬짧hE혧짬짧hEB 1100001110101101110000101001100011000010101001111100001110101100110000101010011111000010101011001100001110101100110000101010011111000010101001110110100001000101110000111010110111000010100110001100001010100111110000111010110011000010101001111100001010101100110000111010110011000010101001111100001010100111011010000100010101000010 c3adc298c2a7c3acc2a7c2acc3acc2a7c2a76845c3adc298c2a7c3acc2a7c2acc3acc2a7c2a7684542
UHC ??§?§??§§hE??§?§??§§hEB 00111111001111111010000111010111001111111010000111010111001111110011111110100001110101111010000111010111011010000100010100111111001111111010000111010111001111111010000111010111001111110011111110100001110101111010000111010111011010000100010101000010 3f3fa1d73fa1d73f3fa1d7a1d768453f3fa1d73fa1d73f3fa1d7a1d7684542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)