To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??豫????ぜ膺??瑤??誼 00111111001111110011111110001001101100110011111100111111100110001010110000111111001111110011111100111111100000101011101011100100010111100011111100111111111010101010001000111111001111111000101101100010 3f3f3f89b33f3f98ac3f3f3f3f82bae45e3f3feaa23f3f8b62
EUC-JP 濚??乙??豫??沅?ぜ膺??瑤??誼 1000111111001001101000010011111100111111101100101011010100111111001111111101000010101110001111110011111110001111110001101110100100111111101001001011110011100111101111110011111100111111111101001010010000111111001111111011010111000011 8fc9a13f3fb2b53f3fd0ae3f3f8fc6e93fa4bce7bf3f3ff4a43f3fb5c3
UTF-8 濚욌꼬乙댁젞豫뗪퍔沅좄ぜ膺얘샹瑤녠퀣誼 111001101011111110011010111011001001101010001100111010101011110010101100111001001011100110011001111010111000110010000001111011001010000010011110111010001011000110101011111010111001011110101010111011011000110110010100111001101011001010000101111011001010001010000100111000111000000110011100111010001000011010111010111011001001011010011000111011001000001110111001111001111001000110100100111010111000010110100000111011011000000010100011111010001010101010111100 e6bf9aec9a8ceabcace4b999eb8c81eca09ee8b1abeb97aaed8d94e6b285eca284e3819ce886baec9698ec83b9e791a4eb85a0ed80a3e8aabc
UHC 濚욌꼬乙댁젞豫뗪퍔沅좄ぜ膺얘샹瑤녠퀣誼 1110011110111001100111101110101110110010101111111110101111100000101101001110110010100000100110001110011111100011100010111110101010111011100010111110101010110110101000001110100010101010101111001110101111101100101111101110101010111100101001111110100011111101101100111110101010110011100101111110101111111110 e7b99eebb2bfebe0b4eca098e7e38beabb8beab6a0e8aabcebecbeeabca7e8fdb3eab397ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)