To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厄μ????音??鴨??????κ?癲???^ 1001011011101111100000111100101000111111001111110011111100111111100010011011100100111111001111111000101010011011001111110011111100111111001111110011111100111111100000111100100000111111111000011001111100111111001111110011111101011110 96ef83ca3f3f3f3f89b93f3f8a9b3f3f3f3f3f3f83c83fe19f3f3f3f5e
EUC-JP 厄μ?堉??音??鴨??馹???κ?癲???^ 110011001111000110100110110011000011111110001111101101111111110100111111001111111011001010111011001111110011111110110011111110110011111100111111100011111110100110100001001111110011111100111111101001101100101000111111111000101010000100111111001111110011111101011110 ccf1a6cc3f8fb7fd3f3fb2bb3f3fb3fb3f3f8fe9a13f3f3fa6ca3fe2a13f3f3f5e
UTF-8 厄μ쥙堉득뿬音쀬꽀鴨앪벃馹쇗벉硫κ쉽癲띛딅솦^ 1110010110001110100001001100111010111100111011001010010110011001111001011010000010001001111010111001001110011101111010111011111110101100111010011001111110110011111011001000000010101100111010101011110110000000111010011011010010101000111011001001010110101010111010111011001010000011111010011010011010111001111011001000011110010111111010111011001010001001111011111010011110001110110011101011101011101100100010011011110111100111100110011011001011101011100111011001101111101011100101001000010111101100100001101010011001011110 e58e84cebceca599e5a089eb939debbface99fb3ec80aceabd80e9b4a8ec95aaebb283e9a6b9ec8797ebb289efa78ecebaec89bde799b2eb9d9beb9485ec86a65e
UHC 厄μ쥙堉득뿬音쀬꽀鴨앪벃馹쇗벉硫κ쉽癲띛딅솦^ 111001001111100010100101111011001010001010001110111010111011110010110101111001101001011110101100111010111110010110010111111011001000010010010101111001001110010110011101111000101001001110101001111011001111000110111100111001101001001110101100111010111010100110100101111010101011110110110001111011111010011010001101110100011000101011101011100110011001111101011110 e4f8a5eca28eebbcb5e697acebe597ec8495e4e59de293a9ecf1bce693aceba9a5eabdb1efa68dd18aeb999f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)