To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥韋??鷹?????苑??碎k?筌 111001011111000100111111100000010110000111101000111010000011111100111111100100011110100100111111001111110011111100111111001111111000100110010001001111110011111111100001111010101000001010001011001111111110001010100011 e5f13f8161e8e83f3f91e93f3f3f3f3f89913f3fe1ea828b3fe2a3
EUC-JP 褥?‖韋??鷹?????苑??碎k?筌 111010101111001100111111101000011100001011110000111010100011111100111111110000101110101100111111001111110011111100111111001111111011000111110001001111110011111111100010111011001010001111101011001111111110010010100101 eaf33fa1c2f0ea3f3fc2eb3f3f3f3f3fb1f13f3fe2eca3eb3fe4a5
UTF-8 褥띕∥韋뤸꼷鷹껊젧醴븐뼚苑섋첑碎k궙筌 111010001010010010100101111010111001110110010101111000101000100010100101111010011001111110001011111010111010010010111000111010101011110010110111111010011011011110111001111010101011101110001010111011001010000010100111111011111010011010110111111010111011100010010000111010111011110010011010111010001000101110010001111011001000010010001011111011001011001010010001111001111010001010001110111011111011110110001011111010101011011010011001111001111010110110001100 e8a4a5eb9d95e288a5e99f8beba4b8eabcb7e9b7b9eabb8aeca0a7efa6b7ebb890ebbc9ae88b91ec848becb291e7a28eefbd8beab699e7ad8c
UHC 褥띕∥韋뤸꼷鷹껊젧醴븐뼚苑섋첑碎k궙筌 1110100110110011101101101110101110100001101010111110101011011111100011111110011010000100100011111110101111101101100000111110101110100000100111111110011111100100101110101110110010010110101000001110101010111101100110001110100010101010100111101110000111101111101000111110101110000010101011101110111110100111 e9b3b6eba1abeadf8fe6848febed83eba09fe7e4baec96a0eabd98e8aa9ee1efa3eb82aeefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)