To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????V 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f56
SJIS-WIN ????????????????????????V 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f56
EUC-JP ????????????????????????V 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f56
UTF-8 센셈센셋센셈센셥렯렱렯렳렯렱렯롍렯렱렯렳렯렱렯렧V 11101100100001001011110011101100100001011000100011101100100001001011110011101100100001011000101111101100100001001011110011101100100001011000100011101100100001001011110011101100100001011010010111101011101000001010111111101011101000001011000111101011101000001010111111101011101000001011001111101011101000001010111111101011101000001011000111101011101000001010111111101011101000011000110111101011101000001010111111101011101000001011000111101011101000001010111111101011101000001011001111101011101000001010111111101011101000001011000111101011101000001010111111101011101000001010011101010110 ec84bcec8588ec84bcec858bec84bcec8588ec84bcec85a5eba0afeba0b1eba0afeba0b3eba0afeba0b1eba0afeba18deba0afeba0b1eba0afeba0b3eba0afeba0b1eba0afeba0a756
UHC 센셈센셋센셈센셥렯렱렯렳렯렱렯롍렯렱렯렳렯렱렯렧V 10111100101111101011110011000000101111001011111010111100110000101011110010111110101111001100000010111100101111101011110011001010100011101011110010001110101111101000111010111100100011101100000010001110101111001000111010111110100011101011110010001110110100111000111010111100100011101011111010001110101111001000111011000000100011101011110010001110101111101000111010111100100011101011011001010110 bcbebcc0bcbebcc2bcbebcc0bcbebcca8ebc8ebe8ebc8ec08ebc8ebe8ebc8ed38ebc8ebe8ebc8ec08ebc8ebe8ebc8eb656

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)