To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??·nf??·n^}Y??·nf??·n^}bE 00111111001111111011011101101110011001100011111100111111101101110110111001011110011111010101100100111111001111111011011101101110011001100011111100111111101101110110111001011110011111010110001001000101 3f3fb76e663f3fb76e5e7d593f3fb76e663f3fb76e5e7d6245
SJIS-WIN 絶??nf絶??n^}Y絶??nf絶??n^}bE 1001000011100010001111110011111101101110011001101001000011100010001111110011111101101110010111100111110101011001100100001110001000111111001111110110111001100110100100001110001000111111001111110110111001011110011111010110001001000101 90e23f3f6e6690e23f3f6e5e7d5990e23f3f6e6690e23f3f6e5e7d6245
EUC-JP 絶??nf絶??n^}Y絶??nf絶??n^}bE 1100000011100100001111110011111101101110011001101100000011100100001111110011111101101110010111100111110101011001110000001110010000111111001111110110111001100110110000001110010000111111001111110110111001011110011111010110001001000101 c0e43f3f6e66c0e43f3f6e5e7d59c0e43f3f6e66c0e43f3f6e5e7d6245
UTF-8 絶뚨·nf絶뚨·n^}Y絶뚨·nf絶뚨·n^}bE 111001111011010110110110111010111001101010101000110000101011011101101110011001101110011110110101101101101110101110011010101010001100001010110111011011100101111001111101010110011110011110110101101101101110101110011010101010001100001010110111011011100110011011100111101101011011011011101011100110101010100011000010101101110110111001011110011111010110001001000101 e7b5b6eb9aa8c2b76e66e7b5b6eb9aa8c2b76e5e7d59e7b5b6eb9aa8c2b76e66e7b5b6eb9aa8c2b76e5e7d6245
UHC 絶뚨·nf絶뚨·n^}Y絶뚨·nf絶뚨·n^}bE 11101111101111101000110011100111101000011010010001101110011001101110111110111110100011001110011110100001101001000110111001011110011111010101100111101111101111101000110011100111101000011010010001101110011001101110111110111110100011001110011110100001101001000110111001011110011111010110001001000101 efbe8ce7a1a46e66efbe8ce7a1a46e5e7d59efbe8ce7a1a46e66efbe8ce7a1a46e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)