To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 巽?+恁??猷ュ?巽?+恁??猷ュ?E 1001001001000110001111111000000101111011100111001000110000111111001111111001011101010001100000111000010100111111100100100100011000111111100000010111101110011100100011000011111100111111100101110101000110000011100001010011111101000101 92463f817b9c8c3f3f975183853f92463f817b9c8c3f3f975183853f45
EUC-JP 巽?+恁??猷ュ?巽?+恁??猷ュ?E 1100001110100111001111111010000111011100110101111110110000111111001111111100110110110010101001011110010100111111110000111010011100111111101000011101110011010111111011000011111100111111110011011011001010100101111001010011111101000101 c3a73fa1dcd7ec3f3fcdb2a5e53fc3a73fa1dcd7ec3f3fcdb2a5e53f45
UTF-8 巽숇+恁㎮뿀猷ュ늾巽숇+恁㎮뿀猷ュ꼐E 11100101101101111011110111101100100010001000011111101111101111001000101111100110100000011000000111100011100011101010111011101011101111111000000011100111100011001011011111100011100000111010010111101011100010101011111011100101101101111011110111101100100010001000011111101111101111001000101111100110100000011000000111100011100011101010111011101011101111111000000011100111100011001011011111100011100000111010010111101010101111001001000001000101 e5b7bdec8887efbc8be68181e38eaeebbf80e78cb7e383a5eb8abee5b7bdec8887efbc8be68181e38eaeebbf80e78cb7e383a5eabc9045
UHC 巽숇+恁㎮뿀猷ュ늾巽숇+恁㎮뿀猷ュ꼐E 11100001110111101001100111101011101000111010101111101100111101101010011111100010100101111000100011101011101000111010101111100101100010001000011111100001110111101001100111101011101000111010101111101100111101101010011111100010100101111000100011101011101000111010101111100101101100101011111001000101 e1de99eba3abecf6a7e29788eba3abe58887e1de99eba3abecf6a7e29788eba3abe5b2be45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)