To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?扱??!淞る?藥??倚т? 00111111001111110011111110001011100000111000000110101000001111111000100010110101001111110011111110000001010010011001111111000010100000101110100100111111111001010101101000111111001111111001100011011111100001001000010000111111 3f3f3f8b8381a83f88b53f3f81499fc282e93fe55a3f3f98df84843f
EUC-JP ???泣→?扱??!淞る?藥??倚т? 00111111001111110011111110110101111000111010001010101010001111111011000010110111001111110011111110100001101010101101111011000100101001001110101100111111111010011011101100111111001111111101000011100001101001111110010000111111 3f3f3fb5e3a2aa3fb0b73f3fa1aadec4a4eb3fe9bb3f3fd0e1a7e43f
UTF-8 捻꿔끇泣→쨫扱琉껈!淞る뤊藥띲꺁倚т슭 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110011010001001101100011110111110100111100011001110101010111011100010001110111110111100100000011110011010110111100111101110001110000010100010111110101110100100100010101110100010010111101001011110101110011101101100101110101010111010100000011110010110000000100110101101000110000010111011001000101010101101 efa6a4eabf94eb8187e6b3a3e28692eca8abe689b1efa78ceabb88efbc81e6b79ee3828beba48ae897a5eb9db2eaba81e5809ad182ec8aad
UHC 捻꿔끇泣→쨫扱琉껈!淞る뤊藥띲꺁倚т슭 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011101000011100010111010111010010010000011111010011010001110100001111000011110011110101010111010111000111110111010111001011011011110001101111000111000001110101010111010111110111110101100111001001011110110111110 e6f7b2e385bbebe8a1e6a485d0e2eba483e9a3a1e1e7aaeb8fbae5b78de383aaebeface4bdbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)