To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???容??松??[???容??松??[^ 00111111001111110011111110010111011001010011111100111111100011111011110000111111001111110101101100111111001111110011111110010111011001010011111100111111100011111011110000111111001111110101101101011110 3f3f3f97653f3f8fbc3f3f5b3f3f3f97653f3f8fbc3f3f5b5e
EUC-JP ???容??松??[???容??松??[^ 00111111001111110011111111001101110001100011111100111111101111101011111000111111001111110101101100111111001111110011111111001101110001100011111100111111101111101011111000111111001111110101101101011110 3f3f3fcdc63f3fbebe3f3f5b3f3f3fcdc63f3fbebe3f3f5b5e
UTF-8 醴ㅲ닜容꾥넠松앶퍍[醴ㅲ닜容꾥넠松앶퍍[^ 111011111010011010110111111000111000010110110010111010111000101110011100111001011010111010111001111010101011111010100101111010111000010010100000111001101001110110111110111011001001010110110110111011011000110110001101010110111110111110100110101101111110001110000101101100101110101110001011100111001110010110101110101110011110101010111110101001011110101110000100101000001110011010011101101111101110110010010101101101101110110110001101100011010101101101011110 efa6b7e385b2eb8b9ce5aeb9eabea5eb84a0e69dbeec95b6ed8d8d5befa6b7e385b2eb8b9ce5aeb9eabea5eb84a0e69dbeec95b6ed8d8d5b5e
UHC 醴ㅲ닜容꾥넠松앶퍍[醴ㅲ닜容꾥넠松앶퍍[^ 111001111110010010100100111000101000100010011101111010011011101110000100111010001000011010100100111000011110011010011101111010011011101110000100010110111110011111100100101001001110001010001000100111011110100110111011100001001110100010000110101001001110000111100110100111011110100110111011100001000101101101011110 e7e4a4e2889de9bb84e886a4e1e69de9bb845be7e4a4e2889de9bb84e886a4e1e69de9bb845b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)