To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???znf???zn^}Y???znf???zn^}bE 0011111100111111001111110111101001101110011001100011111100111111001111110111101001101110010111100111110101011001001111110011111100111111011110100110111001100110001111110011111100111111011110100110111001011110011111010110001001000101 3f3f3f7a6e663f3f3f7a6e5e7d593f3f3f7a6e663f3f3f7a6e5e7d6245
SJIS-WIN 翹るしznf翹るしzn^}Y翹るしznf翹るしzn^}bE 1110001111001001100000101110100110000010101101010111101001101110011001101110001111001001100000101110100110000010101101010111101001101110010111100111110101011001111000111100100110000010111010011000001010110101011110100110111001100110111000111100100110000010111010011000001010110101011110100110111001011110011111010110001001000101 e3c982e982b57a6e66e3c982e982b57a6e5e7d59e3c982e982b57a6e66e3c982e982b57a6e5e7d6245
EUC-JP 翹るしznf翹るしzn^}Y翹るしznf翹るしzn^}bE 1110011011001011101001001110101110100100101101110111101001101110011001101110011011001011101001001110101110100100101101110111101001101110010111100111110101011001111001101100101110100100111010111010010010110111011110100110111001100110111001101100101110100100111010111010010010110111011110100110111001011110011111010110001001000101 e6cba4eba4b77a6e66e6cba4eba4b77a6e5e7d59e6cba4eba4b77a6e66e6cba4eba4b77a6e5e7d6245
UTF-8 翹るしznf翹るしzn^}Y翹るしznf翹るしzn^}bE 1110011110111111101110011110001110000010100010111110001110000001100101110111101001101110011001101110011110111111101110011110001110000010100010111110001110000001100101110111101001101110010111100111110101011001111001111011111110111001111000111000001010001011111000111000000110010111011110100110111001100110111001111011111110111001111000111000001010001011111000111000000110010111011110100110111001011110011111010110001001000101 e7bfb9e3828be381977a6e66e7bfb9e3828be381977a6e5e7d59e7bfb9e3828be381977a6e66e7bfb9e3828be381977a6e5e7d6245
UHC 翹るしznf翹るしzn^}Y翹るしznf翹るしzn^}bE 1100111011101110101010101110101110101010101101110111101001101110011001101100111011101110101010101110101110101010101101110111101001101110010111100111110101011001110011101110111010101010111010111010101010110111011110100110111001100110110011101110111010101010111010111010101010110111011110100110111001011110011111010110001001000101 ceeeaaebaab77a6e66ceeeaaebaab77a6e5e7d59ceeeaaebaab77a6e66ceeeaaebaab77a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)