To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弱??節わ?節←?絶??裔э?節o????^ 1000111011100011001111110011111110010000110111111000001011101101001111111001000011011111100000011010100100111111100100001110001000111111001111111110010111100001100001001000111100111111100100001101111110000010100011110011111100111111001111110011111101011110 8ee33f3f90df82ed3f90df81a93f90e23f3fe5e1848f3f90df828f3f3f3f3f5e
EUC-JP 弱??節わ?節←?絶??裔э?節o????^ 1011110011100101001111110011111111000000111000011010010011101111001111111100000011100001101000101010101100111111110000001110010000111111001111111110101011100011101001111110111100111111110000001110000110100011111011110011111100111111001111110011111101011110 bce53f3fc0e1a4ef3fc0e1a2ab3fc0e43f3feae3a7ef3fc0e1a3ef3f3f3f3f5e
UTF-8 弱녻궗節わ숲節←닇絶뚦컛裔э쉠節o숯連먲풄^ 111001011011110010110001111010111000010110111011111010101011011010010111111001111010111110000000111000111000001010001111111011001000100010110010111001111010111110000000111000101000011010010000111010111000101110000111111001111011010110110110111010111001101010100110111011001011101110011011111010001010001110010100110100011000110111101100100010011010000011100111101011111000000011101111101111011000111111101100100010001010111111101111101001101001101011101011101010001011001011101101100100101000010001011110 e5bcb1eb85bbeab697e7af80e3828fec88b2e7af80e28690eb8b87e7b5b6eb9aa6ecbb9be8a394d18dec89a0e7af80efbd8fec88afefa69aeba8b2ed92845e
UHC 弱녻궗節わ숲節←닇絶뚦컛裔э쉠節o숯連먲풄^ 11100101101100001000011011101000100000101010110011101111101111011010101011101111101111011010001111101111101111011010000111100111100010001001000011101111101111101000110011100101101100001000011011100111111000001010110011101111101111011010101011101111101111011010001111101111101111011010000111100110111001101001000011101111101111101000110001011110 e5b086e882acefbdaaefbda3efbda1e78890efbe8ce5b086e7e0acefbdaaefbda3efbda1e6e690efbe8c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)