To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋?????阿??愉↓?濡レ?沃 0011111100111111001111111110100011101000001111110011111100111111001111110011111110001000101000100011111100111111100101101111100110000001101010110011111110010100010001111000001110001100001111111001011110000000 3f3f3fe8e83f3f3f3f3f88a23f3f96f981ab3f9447838c3f9780
EUC-JP ???韋??洹??阿??愉↓?濡レ?沃 00111111001111110011111111110000111010100011111100111111100011111100011110111010001111110011111110110000101001000011111100111111110011001111101110100010101011010011111111000111101010001010010111101100001111111100110111100000 3f3f3ff0ea3f3f8fc7ba3f3fb0a43f3fccfba2ad3fc7a8a5ec3fcde0
UTF-8 捻뀁궠韋귡윀洹잆걶阿숋퐣愉↓뼸濡レ굣沃 111011111010011010100100111010111000000010000001111010101011011010100000111010011001111110001011111010101011011110100001111011001001110010000000111001101011010010111001111011001001111010000110111010101011000110110110111010011001100010111111111011001000100010001011111011011001000010100011111001101000010010001001111000101000011010010011111010111011110010111000111001101011111110100001111000111000001110101100111010101011010110100011111001101011001010000011 efa6a4eb8081eab6a0e99f8beab7a1ec9c80e6b4b9ec9e86eab1b6e998bfec888bed90a3e68489e28693ebbcb8e6bfa1e383aceab5a3e6b283
UHC 捻뀁궠韋귡윀洹잆걶阿숋퐣愉↓뼸濡レ굣沃 1110011011110111101100101110110010000010101100111110101011011111100000101110100110011111100010111110101010110111100111111110001110000001100111001110010010111001100110011110111110111101100011001110101011110000101000011110100110010110101110111110101110100001101010111110110010110001101101111110100010101010 e6f7b2ec82b3eadf82e99f8beab79fe3819ce4b999efbd8ceaf0a1e996bbeba1abecb1b7e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)