To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 歪????????亦↓?節ら?窈??傲??B 1001100001100011001111110011111100111111001111110011111100111111001111110011111110010110100100101000000110101011001111111001000011011111100000101110011100111111111000100111011100111111001111111001100011111100001111110011111101000010 98633f3f3f3f3f3f3f3f969281ab3f90df82e73fe2773f3f98fc3f3f42
EUC-JP 歪?????旿??亦↓?節ら?窈??傲??B 11001111110001000011111100111111001111110011111100111111100011111100000111110100001111110011111111001011111100101010001010101101001111111100000011100001101001001110100100111111111000111101100000111111001111111101000011111110001111110011111101000010 cfc43f3f3f3f3f8fc1f43f3fcbf2a2ad3fc0e1a4e93fe3d83f3fd0fe3f3f42
UTF-8 歪귨쉠樂됮죺旿⑵럦亦↓굚節ら썖窈붻쑊傲됪깹B 11100110101011011010101011101010101101111010100011101100100010011010000011101111101001101011111111101011100100001010111011101100101000111011101011100110100101111011111111100010100100011011010111101011100111111010011011100100101110101010011011100010100001101001001111101010101101011001101011100111101011111000000011100011100000101000100111101100100011011001011011100111101010101000100011101011101101101011101111101100100100011000101011100101100000101011001011101011100100001010101011101010101110011011100101000010 e6adaaeab7a8ec89a0efa6bfeb90aeeca3bae697bfe291b5eb9fa6e4baa6e28693eab59ae7af80e38289ec8d96e7aa88ebb6bbec918ae582b2eb90aaeab9b942
UHC 歪귨쉠樂됮죺旿⑵럦亦↓굚節ら썖窈붻쑊傲됪깹B 11101000111000001000001011101111101111011010101011101000111110011000100111101001101000011001010011100111111110101010100111101000100011101000100111100110101100101010000111101001100000101000001011101111101111011010101011101001100110111000100111101001101000011001010011101000100111001010100111100111111011001000100111100110101100101010000101000010 e8e082efbdaae8f989e9a194e7faa9e88e89e6b2a1e98282efbdaae99b89e9a194e89ca9e7ec89e6b2a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)