To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伍??泣??純??阿??異??濡レ?沃 100011001101111000111111001111111000101110000011001111110011111110001111100000110011111100111111100010001010001000111111001111111000100011011001001111110011111110010100010001111000001110001100001111111001011110000000 8cde3f3f8b833f3f8f833f3f88a23f3f88d93f3f9447838c3f9780
EUC-JP 伍??泣??純??阿??異??濡レ?沃 101110001110000000111111001111111011010111100011001111110011111110111101111000110011111100111111101100001010010000111111001111111011000011011011001111110011111111000111101010001010010111101100001111111100110111100000 b8e03f3fb5e33f3fbde33f3fb0a43f3fb0db3f3fc7a8a5ec3fcde0
UTF-8 伍밸씮泣쒙쭕純쏇떊阿숈눘異룬뼸濡レ굣沃 111001001011110010001101111010111011000010111000111011001001010010101110111001101011001110100011111011001001001010011001111011001010110110010101111001111011010010010100111011001000111110000111111010111001011010001010111010011001100010111111111011001000100010001000111010111000100010011000111001111001010110110000111010111010001110101100111010111011110010111000111001101011111110100001111000111000001110101100111010101011010110100011111001101011001010000011 e4bc8debb0b8ec94aee6b3a3ec9299ecad95e7b494ec8f87eb968ae998bfec8888eb8898e795b0eba3acebbcb8e6bfa1e383aceab5a3e6b283
UHC 伍밸씮泣쒙쭕純쏇떊阿숈눘異룬뼸濡レ굣沃 1110011111101010101110011110101110011101101111111110101111101000100111001110111110100111100011011110001011101101100110111110110110001011101000001110010010111001100110011110110010000111101100011110110010110110101101111110100110010110101110111110101110100001101010111110110010110001101101111110100010101010 e7eab9eb9dbfebe89cefa78de2ed9bed8ba0e4b999ec87b1ecb6b7e996bbeba1abecb1b7e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)