To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?l?誼??貫誼??貫誼??肯宜??柔?? 001111111000001010001100001111111000101101100010001111110011111110001010110100011000101101100010001111110011111110001010110100011000101101100010001111110011111110001101011011011000101101011000001111110011111110001111010111110011111100111111 3f828c3f8b623f3f8ad18b623f3f8ad18b623f3f8d6d8b583f3f8f5f3f3f
EUC-JP 渶l?誼??貫誼??貫誼??肯宜??柔?? 1000111111000111111011011010001111101100001111111011010111000011001111110011111110110100110100111011010111000011001111110011111110110100110100111011010111000011001111110011111110111001110011101011010110111001001111110011111110111101110000000011111100111111 8fc7eda3ec3fb5c33f3fb4d3b5c33f3fb4d3b5c33f3fb9ceb5b93f3fbdc03f3f
UTF-8 渶l쉶誼숁굝貫誼삥굝貫誼삣맫肯宜쇔쳞柔㏓뙕 111001101011100010110110111011111011110110001100111011001000100110110110111010001010101010111100111011001000100010000001111010101011010110011101111010001011001010101011111010001010101010111100111011001000001010100101111010101011010110011101111010001011001010101011111010001010101010111100111011001000001010100011111010111010011110101011111010001000001010101111111001011010111010011100111011001000011110010100111011001011001110011110111001101001111110010100111000111000111110010011111010111001100110010101 e6b8b6efbd8cec89b6e8aabcec8881eab59de8b2abe8aabcec82a5eab59de8b2abe8aabcec82a3eba7abe882afe5ae9cec8794ecb39ee69f94e38f93eb9995
UHC 渶l쉶誼숁굝貫誼삥굝貫誼삣맫肯宜쇔쳞柔㏓뙕 111001111011011110100011111011001001101010001100111010111111111010011001111001101000001010000101110011101011101111101011111111101011101111100110100000101000010111001110101110111110101111111110101110111110010110010000101100111101000011101001111010111111000110111100111001011010101110000100111010101111010110100111111010111000110010011010 e7b7a3ec9a8cebfe99e68285cebbebfebbe68285cebbebfebbe590b3d0e9ebf1bce5ab84eaf5a7eb8c9a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)