To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伍??泣??純??阿???夜?????濡?? 10001100110111100011111100111111100010111000001100111111001111111000111110000011001111110011111110001000101000100011111100111111001111111001011011101001001111110011111100111111001111110011111110010100010001110011111100111111 8cde3f3f8b833f3f8f833f3f88a23f3f3f96e93f3f3f3f3f94473f3f
EUC-JP 伍??泣??純??阿???夜??沅??濡?? 101110001110000000111111001111111011010111100011001111110011111110111101111000110011111100111111101100001010010000111111001111110011111111001100111010110011111100111111100011111100011011101001001111110011111111000111101010000011111100111111 b8e03f3fb5e33f3fbde33f3fb0a43f3f3fcceb3f3f8fc6e93f3fc7a83f3f
UTF-8 伍밸씮泣쒙쭕純쏇떊阿쇡쇰퓠夜껊툙沅좑쭪濡㏃땡 111001001011110010001101111010111011000010111000111011001001010010101110111001101011001110100011111011001001001010011001111011001010110110010101111001111011010010010100111011001000111110000111111010111001011010001010111010011001100010111111111011001000011110100001111011001000011110110000111011011001001110100000111001011010010010011100111010101011101110001010111011011000100010011001111001101011001010000101111011001010001010010001111011001010110110101010111001101011111110100001111000111000111110000011111010111001010110100001 e4bc8debb0b8ec94aee6b3a3ec9299ecad95e7b494ec8f87eb968ae998bfec87a1ec87b0ed93a0e5a49ceabb8aed8899e6b285eca291ecadaae6bfa1e38f83eb95a1
UHC 伍밸씮泣쒙쭕純쏇떊阿쇡쇰퓠夜껊툙沅좑쭪濡㏃땡 1110011111101010101110011110101110011101101111111110101111101000100111001110111110100111100011011110001011101101100110111110110110001011101000001110010010111001100110011100111010111100111010111011111110001001111001011010100010000011111010111011100010010000111010101011011010100000111011111010011110011110111010111010000110100111111011001011011010101111 e7eab9eb9dbfebe89cefa78de2ed9bed8ba0e4b999cebcebbf89e5a883ebb890eab6a0efa79eeba1a7ecb6af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)