To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 阿??姙ф?鍮??蟻ы?亦??阿??姙ч? 10001000101000100011111100111111100110110100101110000100100001100011111111101000010010100011111100111111100010110110000110000100100011010011111110010110100100100011111100111111100010001010001000111111001111111001101101001011100001001000100100111111 88a23f3f9b4b84863fe84a3f3f8b61848d3f96923f3f88a23f3f9b4b84893f
EUC-JP 阿??姙ф?鍮??蟻ы?亦??阿??姙ч? 10110000101001000011111100111111110101011010110010100111111001100011111111101111101010110011111100111111101101011100001010100111111011010011111111001011111100100011111100111111101100001010010000111111001111111101010110101100101001111110100100111111 b0a43f3fd5aca7e63fefab3f3fb5c2a7ed3fcbf23f3fb0a43f3fd5aca7e93f
UTF-8 阿숈옱姙ф쿅鍮삡튃蟻ы뿬亦싲퍠阿숈옱姙ч빓 111010011001100010111111111011001000100010001000111011001001100010110001111001011010011110011001110100011000010011101100101111111000010111101001100011011010111011101100100000101010000111101101100010101000001111101000100111111011101111010001100010111110101110111111101011001110010010111010101001101110110010001011101100101110110110001101101000001110100110011000101111111110110010001000100010001110110010011000101100011110010110100111100110011101000110000111111010111011100110010011 e998bfec8888ec98b1e5a799d184ecbf85e98daeec82a1ed8a83e89fbbd18bebbface4baa6ec8bb2ed8da0e998bfec8888ec98b1e5a799d187ebb993
UHC 阿숈옱姙ф쿅鍮삡튃蟻ы뿬亦싲퍠阿숈옱姙ч빓 111001001011100110011001111011001001111010101100111011001111010110101100111001101011001010011010111010111011100110111011111001001011100110011001111010111111110010101100111011011001011110101100111001101011001010011010111010111011101110010111111001001011100110011001111011001001111010101100111011001111010110101100111010011001010110110111 e4b999ec9eacecf5ace6b29aebb9bbe4b999ebfcaced97ace6b29aebbb97e4b999ec9eacecf5ace995b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)