To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 偲示偲軸篠ハナ汐篠ヤトツナ辞偲而篠湿h 10001110110000111000111010100110100011101100001110001110101100101000111011000010110010101100010110001110101011001000111011000010110101001100010011000010110001011000111010101011100011101100001110001110101001111000111011000010100011101011110001101000 8ec38ea68ec38eb28ec2cac58eac8ec2d4c4c2c58eab8ec38ea78ec28ebc68
EUC-JP 偲示偲軸篠ハナ汐篠ヤトツナ辞偲而篠湿h 10111100110001011011110010101000101111001100010110111100101101001011110011000100100011101100101010001110110001011011110010101110101111001100010010001110110101001000111011000100100011101100001010001110110001011011110010101101101111001100010110111100101010011011110011000100101111001011111001101000 bcc5bca8bcc5bcb4bcc48eca8ec5bcaebcc48ed48ec48ec28ec5bcadbcc5bca9bcc4bcbe68
UTF-8 偲示偲軸篠ハナ汐篠ヤトツナ辞偲而篠湿h 11100101100000011011001011100111101001001011101011100101100000011011001011101000101110111011100011100111101011111010000011101111101111101000101011101111101111101000010111100110101100011001000011100111101011111010000011101111101111101001010011101111101111101000010011101111101111101000001011101111101111101000010111101000101111101001111011100101100000011011001011101000100000001000110011100111101011111010000011100110101110011011111101101000 e581b2e7a4bae581b2e8bbb8e7afa0efbe8aefbe85e6b190e7afa0efbe94efbe84efbe82efbe85e8be9ee581b2e8808ce7afa0e6b9bf68
UHC ?示?軸篠??汐篠??????而篠?h 0011111111100011110001100011111111110101111011101110000111000110001111110011111111100000101100011110000111000110001111110011111100111111001111110011111100111111111011001011101111100001110001100011111101101000 3fe3c63ff5eee1c63f3fe0b1e1c63f3f3f3f3f3fecbbe1c63f68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)