To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??嚥ヨ?節?ぞ鸚??崖??穩??沃?? 100011001110011000111111001111111001101010001011100000111000100000111111100100001101111100111111100000101011110011101010010111110011111100111111100010100101001000111111001111111110001001110010001111110011111110010111100000000011111100111111 8ce63f3f9a8b83883f90df3f82bcea5f3f3f8a523f3fe2723f3f97803f3f
EUC-JP 梧??嚥ヨ?節?ぞ鸚??崖??穩??沃?? 101110001110100000111111001111111101001111101011101001011110100000111111110000001110000100111111101001001011111011110011110000000011111100111111101100111011001100111111001111111110001111010011001111110011111111001101111000000011111100111111 b8e83f3fd3eba5e83fc0e13fa4bef3c03f3fb3b33f3fe3d33f3fcde03f3f
UTF-8 梧녑뮧嚥ヨ콖節삭ぞ鸚㏆쉰崖꿩엮穩먲슥沃얏닅 111001101010001010100111111010111000010110010001111010111010111010100111111001011001101010100101111000111000001110101000111011001011110110010110111001111010111110000000111011001000001010101101111000111000000110011110111010011011100010011010111000111000111110000110111011001000100110110000111001011011010010010110111010101011111110101001111011001001011110101110111001111010100110101001111010111010100010110010111011001000101010100101111001101011001010000011111011001001011010001111111010111000101110000101 e6a2a7eb8591ebaea7e59aa5e383a8ecbd96e7af80ec82ade3819ee9b89ae38f86ec89b0e5b496eabfa9ec97aee7a9a9eba8b2ec8aa5e6b283ec968feb8b85
UHC 梧녑뮧嚥ヨ콖節삭ぞ鸚㏆쉰崖꿩엮穩먲슥沃얏닅 111001111111110010110011111001011001001010110010111001101011111110101011111010001011000110010000111011111011110110111011111010001010101010111110111001011010010010100111111011111011110110101110111001001111000010110010111001101011111110101011111010001011000110010000111011111011110110111011111010001010101010111110111001101000100010001110 e7fcb3e592b2e6bfabe8b190efbdbbe8aabee5a4a7efbdaee4f0b2e6bfabe8b190efbdbbe8aabee6888e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)