To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????攸?き檍??鍮??邑??五 00111111001111110011111100111111001111110011111110011101101111110011111110000010101010111001111011111000001111110011111111101000010010100011111100111111100101110101011100111111001111111000110011011100 3f3f3f3f3f3f9dbf3f82ab9ef83f3fe84a3f3f97573f3f8cdc
EUC-JP ???沅??攸?き檍??鍮??邑??五 001111110011111100111111100011111100011011101001001111110011111111011010110000010011111110100100101011011101110011111010001111110011111111101111101010110011111100111111110011011011100000111111001111111011100011011110 3f3f3f8fc6e93f3fdac13fa4addcfa3f3fefab3f3fcdb83f3fb8de
UTF-8 嶺뚮뿭沅섋땻攸곷き檍됰챷鍮쇤턁邑뀁뵦五 111011111010011010101011111010111001101010101110111010111011111110101101111001101011001010000101111011001000010010001011111010111001010110111011111001101001010010111000111010101011001110110111111000111000000110001101111001101010101010001101111010111001000010110000111011001011000110110111111010011000110110101110111011001000011110100100111011011000010010000001111010011000001010010001111010111000000010000001111010111011010110100110111001001011101010010100 efa6abeb9aaeebbfade6b285ec848beb95bbe694b8eab3b7e3818de6aa8deb90b0ecb1b7e98daeec87a4ed8481e98291eb8081ebb5a6e4ba94
UHC 嶺뚮뿭沅섋땻攸곷き檍됰챷鍮쇤턁邑뀁뵦五 1110011110101101100011001110101110010111101011011110101010110110100110001110100010001011100100011110101011110010100000011110101110101010101011011110010111100101100010011110101110101010100001001110101110111001101111001110100110110101100111011110101111101001101100101110110010010100101001011110011111101001 e7ad8ceb97adeab698e88b91eaf281ebaaade5e589ebaa84ebb9bce9b59debe9b2ec94a5e7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)