To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??揖??筍ル?瑤??違??諛??鴉 100110001101101000111111001111111001011101001011001111110011111111100010101000011000001110001011001111111110101010100010001111110011111110001000111000010011111100111111111001101000011100111111001111111110100111101011 98da3f3f974b3f3fe2a1838b3feaa23f3f88e13f3fe6873f3fe9eb
EUC-JP 俑??揖??筍ル?瑤??違??諛??鴉 110100001101110000111111001111111100110110101100001111110011111111100100101000111010010111101011001111111111010010100100001111110011111110110000111000110011111100111111111010111110011100111111001111111111001011101101 d0dc3f3fcdac3f3fe4a3a5eb3ff4a43f3fb0e33f3febe73f3ff2ed
UTF-8 俑앹늾揖답풚筍ル솿瑤녹쉷違끺벧諛몄뒧鴉 111001001011111110010001111011001001010110111001111010111000101010111110111001101000111110010110111010111000101110110101111011011001001010011010111001111010110110001101111000111000001110101011111011001000011010111111111001111001000110100100111010111000010110111001111011001000100110110111111010011000000110010101111010111000000110111010111010111011001010100111111010001010101110011011111010111010101010000100111010111001001010100111111010011011010010001001 e4bf91ec95b9eb8abee68f96eb8bb5ed929ae7ad8de383abec86bfe791a4eb85b9ec89b7e98195eb81baebb2a7e8ab9bebaa84eb92a7e9b489
UHC 俑앹늾揖답풚筍ル솿瑤녹쉷違끺벧諛몄뒧鴉 1110100110110101100111011110110010001000100001111110101111100111101101001110010010111110100111011110001011101100101010111110101110011001101100111110100011111101101100111110110010011010100011011110101011011110100001011110010010111010101001101110101110110000101110001110110010001010101000101110010010111100 e9b59dec8887ebe7b4e4be9de2ecabeb99b3e8fdb3ec9a8deade85e4baa6ebb0b8ec8aa2e4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)