To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 燿??節??曜???ο?穩℡?節э?奧 11100000101000000011111100111111100100001101111100111111001111111001011101101010001111110011111100111111100000111100110100111111111000100111001010000111100001000011111110010000110111111000010010001111001111111001101011111010 e0a03f3f90df3f3f976a3f3f3f83cd3fe27287843f90df848f3f9afa
EUC-JP 燿??節??曜??旿ο?穩??節э?奧 1110000010100010001111110011111111000000111000010011111100111111110011011100101100111111001111111000111111000001111101001010011011001111001111111110001111010011001111110011111111000000111000011010011111101111001111111101010011111100 e0a23f3fc0e13f3fcdcb3f3f8fc1f4a6cf3fe3d33f3fc0e1a7ef3fd4fc
UTF-8 燿쒏벦節쏙쉭曜곤슁旿ο쉿穩℡츣節э슁奧 11100111100001111011111111101100100100101000111111101011101100101010011011100111101011111000000011101100100011111001100111101100100010011010110111100110100110111001110011101010101100111010010011101100100010101000000111100110100101111011111111001110101111111110110010001001101111111110011110101001101010011110001010000100101000011110110010111000101000111110011110101111100000001101000110001101111011001000101010000001111001011010010110100111 e787bfec928febb2a6e7af80ec8f99ec89ade69b9ceab3a4ec8a81e697bfcebfec89bfe7a9a9e284a1ecb8a3e7af80d18dec8a81e5a5a7
UHC 燿쒏벦節쏙쉭曜곤슁旿ο쉿穩℡츣節э슁奧 1110100011111100100111001110011010010011101111101110111110111101101111011110111110111101101011011110100011111000101100001110111110111101101100111110011111111010101001011110111110111101101100101110100010110001101000101110010110101110100110101110111110111101101011001110111110111101101100111110011111110011 e8fc9ce693beefbdbdefbdade8f8b0efbdb3e7faa5efbdb2e8b1a2e5ae9aefbdacefbdb3e7f3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)