To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??遺???ル???┼???猥??侑 00111111001111110011111111101000111010000011111100111111100010001110001000111111001111110011111110000011100010110011111100111111001111111000010010101001001111110011111100111111111000001100111000111111001111111001100011010000 3f3f3fe8e83f3f88e23f3f3f838b3f3f3f84a93f3f3fe0ce3f3f98d0
EUC-JP ???韋??遺???ル???┼洧??猥??侑 001111110011111100111111111100001110101000111111001111111011000011100100001111110011111100111111101001011110101100111111001111110011111110101000101010111000111111000111101101000011111100111111111000001101000000111111001111111101000011010010 3f3f3ff0ea3f3fb0e43f3f3fa5eb3f3f3fa8ab8fc7b43f3fe0d03f3fd0d2
UTF-8 捻뀁궠韋껃젆遺븐낄曆ル봾柳뺧┼洧꿸틓猥됰쑚侑 111011111010011010100100111010111000000010000001111010101011011010100000111010011001111110001011111010101011101110000011111011001010000010000110111010011000000110111010111010111011100010010000111010111000001010000100111011111010011010001011111000111000001110101011111010111011010010111110111011111010011110001001111010111011101010100111111000101001010010111100111001101011010010100111111010101011111110111000111011011000101110010011111001111000110010100101111010111001000010110000111011001001000110011010111001001011111010010001 efa6a4eb8081eab6a0e99f8beabb83eca086e981baebb890eb8284efa68be383abebb4beefa789ebbaa7e294bce6b4a7eabfb8ed8b93e78ca5eb90b0ec919ae4be91
UHC 捻뀁궠韋껃젆遺븐낄曆ル봾柳뺧┼洧꿸틓猥됰쑚侑 1110011011110111101100101110110010000010101100111110101011011111100000111110010110100000100010011110101110110110101110101110110010110011101001011110011010110111101010111110101110010100100001011110101011110111100101011110111110100110101010111110101011111011101100101110101010111010100000101110100011100101100010011110101110011100101110011110101011100010 e6f7b2ec82b3eadf83e5a089ebb6baecb3a5e6b7abeb9485eaf795efa6abeafbb2eaba82e8e589eb9cb9eae2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)