To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???伊??諭??熬??愉??揄щ?裔?~ 0011111100111111001111111000100011001001001111110011111110010111010000000011111100111111111000001001001000111111001111111001011011111001001111110011111110011101100010011000010010001011001111111110010111100001001111111000000101100000 3f3f3f88c93f3f97403f3fe0923f3f96f93f3f9d89848b3fe5e13f8160
EUC-JP ???伊??諭??熬??愉??揄щ?裔?〜 0011111100111111001111111011000011001011001111110011111111001101101000010011111100111111110111111111001000111111001111111100110011111011001111110011111111011001111010011010011111101011001111111110101011100011001111111010000111000001 3f3f3fb0cb3f3fcda13f3fdff23f3fccfb3f3fd9e9a7eb3feae33fa1c1
UTF-8 嶺뚳퐣伊싨룚諭꾠럷熬곥굦愉띸춯揄щ굦裔꾩~ 1110111110100110101010111110101110011010101100111110110110010000101000111110010010111100100010101110110010001011101010001110101110100011100110101110100010101011101011011110101010111110101000001110101110011111101101111110011110000110101011001110101010110011101001011110101010110101101001101110011010000100100010011110101110011101101110001110110010110110101011111110011010001111100001001101000110001001111010101011010110100110111010001010001110010100111010101011111010101001111011111011110110011110 efa6abeb9ab3ed90a3e4bc8aec8ba8eba39ae8abadeabea0eb9fb7e786aceab3a5eab5a6e68489eb9db8ecb6afe68f84d189eab5a6e8a394eabea9efbd9e
UHC 嶺뚳퐣伊싨룚諭꾠럷熬곥굦愉띸춯揄щ굦裔꾩~ 111001111010110110001100111011111011110110001100111011001010010110011010111001101000111110010110111010111011000110000100111000111000111010010110111010001010001010000001111000111000001010001100111010101111000010001101111001111010110110001100111010101111000110101100111010111000001010001100111001111110000010000100111011001010001010100110 e7ad8cefbd8ceca59ae68f96ebb184e38e96e8a281e3828ceaf08de7ad8ceaf1aceb828ce7e084eca2a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)