To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?音??奄??乳??魏???κ? 00111111001111110011111110001011100000111000000110101000001111111000100110111001001111110011111110001001100000100011111100111111100100111111101100111111001111111110100110110000001111110011111100111111100000111100100000111111 3f3f3f8b8381a83f89b93f3f89823f3f93fb3f3fe9b03f3f3f83c83f
EUC-JP ???泣→?音??奄??乳??魏???κ? 00111111001111110011111110110101111000111010001010101010001111111011001010111011001111110011111110110001111000100011111100111111110001101111110100111111001111111111001010110010001111110011111100111111101001101100101000111111 3f3f3fb5e3a2aa3fb2bb3f3fb1e23f3fc6fd3f3ff2b23f3f3fa6ca3f
UTF-8 捻꿔끇泣→쨫音쀬젘奄멸낯乳면쪛魏껎돪若κ퓚 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110100110011111101100111110110010000000101011001110110010100000100110001110010110100101100001001110101110101001101110001110101110000010101011111110010010111001101100111110101110101001101101001110110010101010100110111110100110101101100011111110101010111011100011101110101110001111101010101110111110100101101101001100111010111010111011011001001110011010 efa6a4eabf94eb8187e6b3a3e28692eca8abe99fb3ec80aceca098e5a584eba9b8eb82afe4b9b3eba9b4ecaa9be9ad8feabb8eeb8faaefa5b4cebaed939a
UHC 捻꿔끇泣→쨫音쀬젘奄멸낯乳면쪛魏껎돪若κ퓚 111001101111011110110010111000111000010110111011111010111110100010100001111001101010010010000101111010111110010110010111111011001010000010010100111001011111001010111000111010101011001110111000111010101110000110111000111010011010010110010100111010101110000010000011111011011000100110101101111001011010111010100101111010101011111110000101 e6f7b2e385bbebe8a1e6a485ebe597eca094e5f2b8eab3b8eae1b8e9a594eae083ed89ade5aea5eabf85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)