To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???議?????????????????瀛 001111110011111100111111100010110110001100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111110000001101001 3f3f3f8b633f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fe069
EUC-JP ???議??????????????瑗??瀛 0011111100111111001111111011010111000100001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001111110011001100000000111111001111111101111111001010 3f3f3fb5c43f3f3f3f3f3f3f3f3f3f3f3f3f3f8fccc03f3fdfca
UTF-8 溜삠뀛議덈졎栒붽섹栒붼젞栒붿죭溜삠뀛瑗썬뀛瀛 111011111010011110001011111011001000001010100000111010111000000010011011111010001010110110110000111010111000110110001000111011001010000110001110111001101010000010010010111010111011011010111101111011001000010010111001111001101010000010010010111010111011011010111100111011001010000010011110111001101010000010010010111010111011011010111111111011001010001110101101111011111010011110001011111011001000001010100000111010111000000010011011111001111001000110010111111011001000110110101100111010111000000010011011111001111000000010011011 efa78bec82a0eb809be8adb0eb8d88eca18ee6a092ebb6bdec84b9e6a092ebb6bceca09ee6a092ebb6bfeca3adefa78bec82a0eb809be79197ec8daceb809be7809b
UHC 溜삠뀛議덈졎栒붽섹栒붼젞栒붿죭溜삠뀛瑗썬뀛瀛 1110101011111110101110111110001110000101100101001110110010100001100010001110101110100000101110111110001011100011100101001110101010111100101111011110001011100011100101001110100110100000100110001110001011100011100101001110110010100001100010001110101011111110101110111110001110000101100101001110101010111100101111011110001110000101100101001110011110111010 eafebbe38594eca188eba0bbe2e394eabcbde2e394e9a098e2e394eca188eafebbe38594eabcbde38594e7ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)