To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????音??鴨??壹???κ?癲???^ 00111111001111110011111100111111001111110011111110001001101110010011111100111111100010101001101100111111001111111001101011100011001111110011111100111111100000111100100000111111111000011001111100111111001111110011111101011110 3f3f3f3f3f3f89b93f3f8a9b3f3f9ae33f3f3f83c83fe19f3f3f3f5e
EUC-JP ???堉??音??鴨??壹???κ?癲???^ 001111110011111100111111100011111011011111111101001111110011111110110010101110110011111100111111101100111111101100111111001111111101010011100101001111110011111100111111101001101100101000111111111000101010000100111111001111110011111101011110 3f3f3f8fb7fd3f3fb2bb3f3fb3fb3f3fd4e53f3f3fa6ca3fe2a13f3f3f5e
UTF-8 樂띾맩堉득뿬音쀬꽀鴨앪몼壹방벉硫κ쉽癲띛딅솦^ 111011111010011010111111111010111001110110111110111010111010011110101001111001011010000010001001111010111001001110011101111010111011111110101100111010011001111110110011111011001000000010101100111010101011110110000000111010011011010010101000111011001001010110101010111010111010101010111100111001011010001110111001111010111011000010101001111010111011001010001001111011111010011110001110110011101011101011101100100010011011110111100111100110011011001011101011100111011001101111101011100101001000010111101100100001101010011001011110 efa6bfeb9dbeeba7a9e5a089eb939debbface99fb3ec80aceabd80e9b4a8ec95aaebaabce5a3b9ebb0a9ebb289efa78ecebaec89bde799b2eb9d9beb9485ec86a65e
UHC 樂띾맩堉득뿬音쀬꽀鴨앪몼壹방벉硫κ쉽癲띛딅솦^ 111010001111100110001101111010111001000010110001111010111011110010110101111001101001011110101100111010111110010110010111111011001000010010010101111001001110010110011101111000101001000110100001111011001110110010111001111001101001001110101100111010111010100110100101111010101011110110110001111011111010011010001101110100011000101011101011100110011001111101011110 e8f98deb90b1ebbcb5e697acebe597ec8495e4e59de291a1ececb9e693aceba9a5eabdb1efa68dd18aeb999f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)