To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??艤??恝逸??攸??豫?,???喩?? 111000111010000000111111001111111110010001111110001111110011111111111010101111001000100011101101001111110011111110011101101111110011111100111111100110001010110000111111100000010100001100111111001111110011111110011010011001110011111100111111 e3a03f3fe47e3f3ffabc88ed3f3f9dbf3f3f98ac3f81433f3f3f9a673f3f
EUC-JP 罌??艤??恝逸??攸??豫?,???喩?? 11100110101000100011111100111111111001111101111100111111001111111000111110111101111001111011000011101111001111110011111111011010110000010011111100111111110100001010111000111111101000011010010000111111001111110011111111010011110010000011111100111111 e6a23f3fe7df3f3f8fbde7b0ef3f3fdac13f3fd0ae3fa1a43f3f3fd3c83f3f
UTF-8 罌살슃艤욕눢恝逸썸뀆攸됱뒴豫뗫,柳좂독喩쏆뒾 111001111011110110001100111011001000001010110100111011001000101010000011111010001000100110100100111011001001101010010101111010111000100010100010111001101000000110011101111010011000000010111000111011001000110110111000111010111000000010000110111001101001010010111000111010111001000010110001111010111001001010110100111010001011000110101011111010111001011110101011111011111011110010001100111011111010011110001001111011001010001010000010111010111000111110000101111001011001011010101001111011001000111110000110111010111001001010111110 e7bd8cec82b4ec8a83e889a4ec9a95eb88a2e6819de980b8ec8db8eb8086e694b8eb90b1eb92b4e8b1abeb97abefbc8cefa789eca282eb8f85e596a9ec8f86eb92be
UHC 罌살슃艤욕눢恝逸썸뀆攸됱뒴豫뗫,柳좂독喩쏆뒾 1110010110100010101110111110110010011010100101011110101111111010101111111110010110000111101110011100111010111111111011001110111110111101111001101000010110000010111010101111001010001001111011001000101010101101111001111110001110001011111010111010001110101100111010101111011110100000111001111011010110110110111010101110011110011011111011001000101010110100 e5a2bbec9a95ebfabfe587b9cebfecefbde68582eaf289ec8aade7e38beba3aceaf7a0e7b5b6eae79bec8ab4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)