To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??意??議??艶k?悠??矣??? 1110101001011111001111110011111110001000110100110011111100111111100010110110001100111111001111111000100110010000100000101000101100111111100101110100100100111111001111111110000111100001001111110011111100111111 ea5f3f3f88d33f3f8b633f3f8990828b3f97493f3fe1e13f3f3f
EUC-JP 鸚??意??議??艶k?悠??矣??孼 11110011110000000011111100111111101100001101010100111111001111111011010111000100001111110011111110110001111100001010001111101011001111111100110110101010001111110011111111100010111000110011111100111111100011111011101011000011 f3c03f3fb0d53f3fb5c43f3fb1f0a3eb3fcdaa3f3fe2e33f3f8fbac3
UTF-8 鸚쒓퍓意쎿룚議용옜艶k챷悠뉐땔矣곗뒻孼 111010011011100010011010111011001001001010010011111011011000110110010011111001101000010010001111111011001000111010111111111010111010001110011010111010001010110110110000111011001001101010101001111011001001100010011100111010001000100110110110111011111011110110001011111011001011000110110111111001101000001010100000111010111000100110010000111010111001010110010100111001111001111110100011111010101011001110010111111010111001001010111011111001011010110110111100 e9b89aec9293ed8d93e6848fec8ebfeba39ae8adb0ec9aa9ec989ce889b6efbd8becb1b7e682a0eb8990eb9594e79fa3eab397eb92bbe5adbc
UHC 鸚쒓퍓意쎿룚議용옜艶k챷悠뉐땔矣곗뒻孼 1110010110100100100111001110101010111011100010101110101111110010100110111110011010001111100101101110110010100001101111111110101110111111101111111110011011111101101000111110101110101010100001001110101011101101100001111110010110110110101010101110101111111000101100001110110010001010101100011110010111101101 e5a49ceabb8aebf29be68f96eca1bfebbfbfe6fda3ebaa84eaed87e5b6aaebf8b0ec8ab1e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)