To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??邑??猥??裕??矜伊??認 001111110011111100111111111010001110100000111111001111111001011101010111001111110011111111100000110011100011111100111111100101110101010000111111001111111110000111100000100010001100100100111111001111111001010001000110 3f3f3fe8e83f3f97573f3fe0ce3f3f97543f3fe1e088c93f3f9446
EUC-JP ???韋??邑??猥??裕??矜伊??認 001111110011111100111111111100001110101000111111001111111100110110111000001111110011111111100000110100000011111100111111110011011011010100111111001111111110001011100010101100001100101100111111001111111100011110100111 3f3f3ff0ea3f3fcdb83f3fe0d03f3fcdb53f3fe2e2b0cb3f3fc7a7
UTF-8 捻뀁궠韋껃젆邑룔렆猥됰굝裕드슫矜伊싨궇認 111011111010011010100100111010111000000010000001111010101011011010100000111010011001111110001011111010101011101110000011111011001010000010000110111010011000001010010001111010111010001110010100111010111010000010000110111001111000110010100101111010111001000010110000111010101011010110011101111010001010001110010101111010111001001110011100111011001000101010101011111001111001111110011100111001001011110010001010111011001000101110101000111010101011011010000111111010001010101010001101 efa6a4eb8081eab6a0e99f8beabb83eca086e98291eba394eba086e78ca5eb90b0eab59de8a395eb939cec8aabe79f9ce4bc8aec8ba8eab687e8aa8d
UHC 捻뀁궠韋껃젆邑룔렆猥됰굝裕드슫矜伊싨궇認 11100110111101111011001011101100100000101011001111101010110111111000001111100101101000001000100111101011111010011011011111100011100011101010000011101000111001011000100111101011100000101000010111101011101011101011010111100101100110101011010011010000111010001110110010100101100110101110011010000010101000001110110011100011 e6f7b2ec82b3eadf83e5a089ebe9b7e38ea0e8e589eb8285ebaeb5e59ab4d0e8eca59ae682a0ece3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)