To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雅??????徇??膺?????援?ぜ?溢 100010011110101100111111001111110011111100111111001111110011111110011100011011010011111100111111111001000101111000111111001111110011111100111111001111111000100110000111001111111000001010111010001111111000100011101100 89eb3f3f3f3f3f3f9c6d3f3fe45e3f3f3f3f3f89873f82ba3f88ec
EUC-JP 雅??????徇??膺??濚?Ŧ援?ぜ?溢 10110010111011010011111100111111001111110011111100111111001111111101011111001110001111110011111111100111101111110011111100111111100011111100100110100001001111111000111110101001101011111011000111100111001111111010010010111100001111111011000011101110 b2ed3f3f3f3f3f3fd7ce3f3fe7bf3f3f8fc9a13f8fa9afb1e73fa4bc3fb0ee
UTF-8 雅붞살뎾連곌퇎徇쒏씭膺뚯땔濚밸Ŧ援욆ぜ짰溢 1110100110011011100001011110101110110110100111101110110010000010101101001110101110001110101111101110111110100110100110101110101010110011100011001110110110000111100011101110010110111110100001111110110010010010100011111110110010010100101011011110100010000110101110101110101110011010101011111110101110010101100101001110011010111111100110101110101110110000101110001100010110100110111001101000111110110100111011001001101010000110111000111000000110011100111011001010011110110000111001101011101010100010 e99b85ebb69eec82b4eb8ebeefa69aeab38ced878ee5be87ec928fec94ade886baeb9aafeb9594e6bf9aebb0b8c5a6e68fb4ec9a86e3819ceca7b0e6baa2
UHC 雅붞살뎾連곌퇎徇쒏씭膺뚯땔濚밸Ŧ援욆ぜ짰溢 111001001011101010010100110011101011101111101100100010011001000111100110111001101011000011101010101101111001111111100010110111111001110011100110100111011011111011101011111011001000110011101100101101101010101011100111101110011011100111101011101010001010111011101010101101011001111011101000101010101011110011000010101011101110110011101110 e4ba94cebbec8991e6e6b0eab79fe2df9ce69dbeebec8cecb6aae7b9b9eba8aeeab59ee8aabcc2aeecee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)