To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
SJIS-WIN 螯、閠頚nf螯、閠頚n^}Y螯、閠頚nf螯、閠頚n^}bE 1110010110100110101001001110100010000000100011000111101001101110011001101110010110100110101001001110100010000000100011000111101001101110010111100111110101011001111001011010011010100100111010001000000010001100011110100110111001100110111001011010011010100100111010001000000010001100011110100110111001011110011111010110001001000101 e5a6a4e8808c7a6e66e5a6a4e8808c7a6e5e7d59e5a6a4e8808c7a6e66e5a6a4e8808c7a6e5e7d6245
EUC-JP 螯、閠頚nf螯、閠頚n^}Y螯、閠頚nf螯、閠頚n^}bE 111010101010100010001110101001001110111111100000101101111101101101101110011001101110101010101000100011101010010011101111111000001011011111011011011011100101111001111101010110011110101010101000100011101010010011101111111000001011011111011011011011100110011011101010101010001000111010100100111011111110000010110111110110110110111001011110011111010110001001000101 eaa88ea4efe0b7db6e66eaa88ea4efe0b7db6e5e7d59eaa88ea4efe0b7db6e66eaa88ea4efe0b7db6e5e7d6245
UTF-8 螯、閠頚nf螯、閠頚n^}Y螯、閠頚nf螯、閠頚n^}bE 11101000100111101010111111101111101111011010010011101001100101101010000011101001101000001001101001101110011001101110100010011110101011111110111110111101101001001110100110010110101000001110100110100000100110100110111001011110011111010101100111101000100111101010111111101111101111011010010011101001100101101010000011101001101000001001101001101110011001101110100010011110101011111110111110111101101001001110100110010110101000001110100110100000100110100110111001011110011111010110001001000101 e89eafefbda4e996a0e9a09a6e66e89eafefbda4e996a0e9a09a6e5e7d59e89eafefbda4e996a0e9a09a6e66e89eafefbda4e996a0e9a09a6e5e7d6245
UHC ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)